SEQUENCE LISTING 



L INFORMATION: 
APPLICANT : 



el Vlodavsky and Elei 



TITLE OF INVENTION: 



NUMBER OF SEQUENCES : 
CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: 

(B) STREET: 

(C) CITY: 

(D) STATE: 

(E) COUNTRY: 

(F) ZIP: 

COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 



POLYNUCLEOTIDE ENCODING A POLYPEPTIDE 
HAVING HEPARANASE ACTIVITY AND EXPRESSION 
OF SAME IN GENETICALLY MODIFIED CELLS 



United i 
22202 



1.4 4 megabyte, 3.5" 



lead* Slii 



S90TX 



. 6.2, 



MS DOS A 
Windows version 3.11 
Word for Windows vers: 
an ASCI file 



. 2.0 converted t 



CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 
PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(A) APPLICATION NUMBER 

(B) FILING DATE: 
ATTORNEY /AGENT INFORMATION: 

(A) NAME: 

(B) REGISTRATION NUMBER: 

(C) REFERENCE /DOCKET NUMBER: 
TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 

(B) TELEFAX: 

(C) TELEX: 



08/922, 170 
2 SEP 1997 
09/109,386 
10 JUL 1998 
PCT/US98/17954 
31 AUG 1998 



33,883 
910/14 



972-3-5625553 
972-3-5625554 



INFORMATION FOR SEQ ID NO : 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic aci 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 
CCATCCTAAT ACGACTCACT ATAGGGC 27 



INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
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GTAGTGATGC CATGTAACTG AATC 24 



INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ACTCACTATA GGGCTCGAGC GGC 23 

INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GCATCTTAGC CGTCTTTCTT CG 22 

INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
TTTTTTTTTT TTTTT 15 



FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS : 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TTCGATCCCA AGAAGGAATC AAC 23 

INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
: CATGTAACTG AATC 24 



FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
Tyr Gly Pro Asp Val Gly Gin Pro Arg 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1721 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CTAGAGCTTT CGACTCTCCG CTGCGCGGCA GCTGGCGGGG GGAGCAGCCA GGTGAGCCCA 60 

AGATGCTGCT GCGCTCGAAG CCTGCGCTGC CGCCGCCGCT GATGCTGCTG CTCCTGGGGC 120 

CGCTGGGTCC CCTCTCCCCT GGCGCCCTGC CCCGACCTGC GCAAGCACAG GACGTCGTGG 180 

ACCTGGACTT cTTCACCCAG GAGCCGCTGC ACCTGGTGAG CCCCTCGTTC CTGTCCGTCA 240 

CCATTGACGC CAACCTGGCC ACGGACCCGC GGTTCCTCAT CCTCCTGGGT TCTCCAAAGC 300 

TTCGTACCT.T GGCCAGAGGC TTGTCTCCTG CGTACCTGAG GTTTGGTGGC ACCAAGACAG 360 

ACTTCCTAAT TTTCGATCCC AAGAAGGAAT CAACCTTTGA AGAGAGAAGT TACTGGCAAT 420 

CTCAAGTCAA CCAGGATATT TGCAAATATG GATCCATCCC TCCTGATGTG GAGGAGAAGT 480 

TACGGTTGGA ATGGCCCTAC CAGGAGCAAT TGCTACTCCG AGAACACTAC CAGAAAAAGT 540 

T CAAGAAC AG CACCTACTCA AGAAGCTCTG TAGATGTGCT ATACACTTTT GCAAACTGCT 600 

CAGGACTGGA CTTGATCTTT GGCCTAAATG CGTTATTAAG AACAGCAGAT TTGCAGTGGA 660 

ACAGTTCTAA TGCTCAGTTG CTCCTGGACT ACTGCTCTTC CAAGGGGTAT AACATTTCTT 720 

GGGAACTAGG CAATGAACCT AACAGTTTCC TTAAGAAGGC TGATATTTTC ATCAATGGGT 780 

CGCAGTTAGG AGAAGATTAT ATTCAATTGC ATAAACTTCT AAGAAAGTCC AC CTTCAAAA 840 

ATGCAAAACT CTATGGTCCT GATGTTGGTC AGCCTCGAAG AAAGACGGCT AAGATGCTGA 900 

AGAGCTTCCT GAAGGCTGGT GGAGAAGTGA TTGATTCAGT TACATGGCAT CACTACTATT 960 

TGAATGGACG GACTGCTACC AGGGAAGATT TTCTAAACCC TGATGTATTG GACATTTTTA 102 0 

TTTCATCTGT GCAAAAAGTT TTCCAGGTGG TTGAGAGCAC CAGGCCTGGC AAGAAGGTCT 108 0 

GGTTAGGAGA AACAAGCTCT GCATATGGAG GCGGAGCGCC CTTGCTATCC GACACCTTTG 1140 

CAGCTGGCTT TATGTGGCTG GATAAATTGG GCCTGTCAGC CCGAATGGGA ATAGAAGTGG 1200 

TGATGAGGCA AGTATTCTTT GGAGCAGGAA ACTACCATTT AGTGGATGAA AACTTCGATC 12 60 

CTTTACCTGA TTATTGGCTA TCTCTTCTGT TCAAGAAATT GGTGGGCACC AAGGTGTTAA 1320 

TGGCAAGCGT GCAAGGTTCA AAGAGAAGGA AGCTTCGAGT ATACCTTCAT TGCACAAACA 1380 

CTGACAATCC AAGGTATAAA GAAGGAGATT TAACTCTGTA TGCCATAAAC CTCCATAACG 14 4 0 

TCACCAAGTA CTTGCGGTTA CCCTATCCTT TTTCTAACAA GCAAGTGGAT AAATACCTTC 1500 

TAAGACCTTT GGGACCTCAT GGATTACTTT CCAAATCTGT CCAACTCAAT GGTCTAACTC 1560 

TAAAGATGGT GGATGATCAA ACCTTGCCAC CTTTAATGGA AAAACCTCTC CGGCCAGGAA 1620 

GTTCACTGGG CTTGCCAGCT TTCTCATATA GTTTTTTTGT GATAAGAAAT GCCAAAGTTG 1680 

CTGCTTGCAT CTGAAAATAA AATATACTAG TCCTGACACT G 1721 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 54 3 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
Met Leu Leu Arg Ser Lys Pro Ala Leu Pro Pro Pro Leu Met Leu Leu 
5 10 15 

Leu Leu Gly Pro Leu Gly Pro Leu Ser Pro Gly Ala Leu Pro Arg Pro 



Ala Gin Ala Gin Asp Val Val Asp Leu Asp Phe Phe Thr Gin Glu Pro 



Leu His Leu Val Ser Pro Ser Phe Leu Ser Val Thr He Asp Ala 



Leu Ala Thr Asp Pro Arg Phe Leu lie Leu Leu Gly Ser Pro Lys Leu 
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65 70 75 80 

Arg Thr Leu Ala Arg Gly Leu Ser Pro Ala Tyr Leu Arg Phe Gly Gly 
85 90 95 

Thr Lys Thr Asp Phe Leu He Phe Asp Pro Lys Lys Glu Ser Thr Phe 
100 105 110 

Glu Glu Arg Ser Tyr Trp Gin Ser Gin Val Asn Gin Asp He Cys Lys 
115 120 125 

Tyr Gly Ser He Pro Pro Asp Val Glu Glu Lys Leu Arg Leu Glu Trp 
130 135 140 

Pro Tyr Gin Glu Gin Leu Leu Leu Arg Glu His Tyr Gin Lys Lys Phe 
145 150 155 160 

Lys Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Val Leu Tyr Thr Phe 
165 170 175 

Ala Asn Cys Ser Gly Leu Asp Leu He Phe Gly Leu Asn Ala Leu Leu 
180 185 190 

Arg Thr Ala Asp Leu Gin Trp Asn Ser Ser Asn Ala Gin Leu Leu Leu 
195 200 205 

Asp Tyr Cys Ser Ser Lys Gly Tyr Asn He Ser Trp Glu Leu Gly Asn 
210 215 220 

Glu Pro Asn Ser Phe Leu Lys Lys Ala Asp He Phe He Asn Gly Ser 
225 230 235 240 

Gin Leu Gly Glu Asp Tyr He Gin Leu His Lys Leu Leu Arg Lys Ser 
245 250 255 

Thr Phe Lys Asn Ala Lys Leu Tyr Gly Pro Asp Val Gly Gin Pro Arg 
260 265 270 

Arg Lys Thr Ala Lys Met Leu Lys Ser Phe Leu Lys Ala Gly Gly Glu 
275 280 285 



Val lie Asp Ser Val Thr Trp His 
290 295 

Ala Thr Arg Glu Asp Phe Leu Asn 
305 310 

Ser Ser Val Gin Lys Val Phe Gin 
325 

Lys Lys Val Trp Leu Gly Glu Thr 
340 

Pro Leu Leu Ser Asp Thr Phe Ala 
355 360 



His Tyr Tyr Leu Asn Gly Arg Thr 
300 

Pro Asp Val Leu Asp He Phe He 
315 320 

Val Val Glu Ser Thr Arg Pro Gly 
330 335 

Ser Ser Ala Tyr Gly Gly Gly Ala 
345 350 

Ala Gly Phe Met Trp Leu Asp Lys 
365 



Leu Gly Leu Ser Ala Arg Met Gly He Glu Val Val Met Arg Gin Val 
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370 375 380 

Phe Phe Gly Ala Gly Asn Tyr His Leu Val Asp Glu Asn Phe Asp Pro 
385 390 395 400 

Leu Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu Val Gly Thr 
405 410 415 

Lys Val Leu Met Ala Ser Val Gin Gly Ser Lys Arg Arg Lys Leu Arg 
420 425 430 

Val Tyr Leu His Cys Thr Asn Thr Asp Asn Pro Arg Tyr Lys Glu Gly 
435 440 445 

Asp Leu Thr Leu Tyr Ala lie Asn Leu His Asn Val Thr Lys Tyr Leu 
450 455 460 

Arg Leu Pro Tyr Pro Phe Ser Asn Lys Gin Val Asp Lys Tyr Leu Leu 
465 470 475 480 

Arg Pro Leu Gly Pro His Gly Leu Leu Ser Lys Ser Val Gin Leu Asn 
485 490 495 

Gly Leu Thr Leu Lys Met Val Asp Asp Gin Thr Leu Pro Pro Leu Met 
500 505 510 

Glu Lys Pro Leu Arg Pro Gly Ser Ser Leu Gly Leu Pro Ala Phe Ser 
515 520 525 

Tyr Ser Phe Phe Val He Arg Asn Ala Lys Val Ala Ala Cys He 
530 535 540 543 



INFORMATION FOR SEQ ID NO: 11: 
(X) SEQUENCE CHARACTERISTICS: 



(C) STRANDEDNESS: 

(D) TOPOLOGY: 
SEQUENCE DESCRIPTION: 
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CT AGA GCT TTC GAC 14 



TCT CCG CTG CGC GGC AGC TGG CGG GGG GAG CAG CCA GGT GAG CCC AAG 62 

ATG CTG CTG CGC TCG AAG CCT GCG CTG CCG CCG CCG CTG ATG CTG CTG 110 
Met Leu Leu Arg Ser Lys Pro Ala Leu Pro Pro Pro Leu Met Leu Leu 



CTC CTG GGG CCG CTG GGT CCC CTC TCC CCT GGC GCC CTG CCC CGA CCT 158 
Leu Leu Gly Pro Leu Gly Pro Leu Ser Pro Gly Ala Leu Pro Arg Pro 



GCG CAA GCA CAG GAC GTC GTG GAC CTG GAC TTC TTC ACC CAG GAG CCC 
Ala Gin Ala Gin Asp Val Val Asp Leu Asp Phe Phe Thr Gin Glu Pr< 



CTG CAC CTG GTG AGC CCC TCG TTC CTG TCC GTC ACC ATT GAC GCC AAC 25 4 
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Leu His Leu Val Ser Pro Ser Phe Leu Ser Val Thr He Asp Ala Asn 
50 55 60 



CTG GCC ACG GAC CCG CGG TTC CTC ATC CTC CTG GGT TCT CCA AAG CTT 302 
Leu Ala Thr Asp Pro Arg Phe Leu He Leu Leu Gly Ser Pro Lys Leu 



CGT ACC TTG GCC AGA GGC TTG TCT CCT GCG TAC CTG AGG TTT GGT GGC 350 
Arg Thr Leu Ala Arg Gly Leu Ser Pro Ala Tyr Leu Arg Phe Gly Gly 



ACC AAG ACA GAC TTC CTA ATT TTC GAT CCC AAG AAG GAA TCA t 
Thr Lys Thr Asp Phe Leu He Phe Asp Pro Lys Lys Glu Ser 1 



GAA GAG AGA AGT TAC TGG CAA TCT CAA GTC AAC CAG GAT ATT TGC AAA 
Glu Glu Arg Ser Tyr Trp Gin Ser Gin Val Asn Gin Asp He Cys Lys 



TAT GGA TCC ATC CCT CCT GAT GTG GAG GAG AAG TTA CGG TTG GAA TGG 494 
Tyr Gly Ser He Pro Pro Asp Val Glu Glu Lys Leu Arg Leu Glu Trp 
130 135 140 

CCC TAC CAG GAG CAA TTG CTA CTC CGA GAA CAC TAC CAG AAA AAG TTC 5 42 
Pro Tyr Gin Glu Gin Leu Leu Leu Arg Glu His Tyr Gin Lys Lys Phe 
145 150 155 160 

AAG AAC AGC ACC TAC TCA AGA AGC TCT GTA GAT GTG CTA TAC ACT TTT 5 90 
Lys Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Val Leu Tyr Thr Phe 



^ AAC TGC TCA GGA CTG GAC TTG ATC TTT GGC CTA AAT GCG TTA TTA 638 
i Asn Cys Ser Gly Leu Asp Leu He Phe Gly Leu Asn Ala Leu Leu 



AGA ACA GCA GAT TTG CAG TGG AAC AGT TCT AAT GCT CAG TTG CTC CTG 
Arg Thr Ala Asp Leu Gin Trp Asn Ser Ser Asn Ala Gin Leu Leu Leu 



GAC TAC TGC TCT TCC AAG GGG TAT AAC ATT TCT TGG GAA CTA GGC AAT 73 4 
Asp Tyr Cys Ser Ser Lys Gly Tyr Asn He Ser Trp Glu Leu Gly Asn 
210 215 220 

GAA CCT AAC AGT TTC CTT AAG AAG GCT GAT ATT TTC ATC AAT GGG TCG 782 
Glu Pro Asn Ser Phe Leu Lys Lys Ala Asp He Phe He Asn Gly Ser 
225 230 235 240 

CAG TTA GGA GAA GAT TAT ATT CAA TTG CAT AAA CTT CTA AGA AAG TCC 830 
Gin Leu Gly Glu Asp Tyr He Gin Leu His Lys Leu Leu Arg Lys Ser 
245 250 255 

ACC TTC AAA AAT GCA AAA CTC TAT GGT CCT GAT GTT GGT CAG CCT CGA 878 
Thr Phe Lys Asn Ala Lys Leu Tyr Gly Pro Asp Val Gly Gin Pro Arg 
260 265 270 

AGA AAG ACG GCT AAG ATG CTG AAG AGC TTC CTG AAG GCT GGT GGA GAA 926 
Arg Lys Thr Ala Lys Met Leu Lys Ser Phe Leu Lys Ala Gly Gly Glu 



GTG ATT GAT TCA GTT ACA TGG CAT CAC TAC TAT TTG AAT GGA CGG ACT 974 

Val He Asp Ser Val Thr Trp His His Tyr Tyr Leu Asn Gly Arg Thr 
290 295 300 

GCT ACC AGG GAA GAT TTT CTA AAC CCT GAT GTA TTG GAC ATT TTT ATT 1022 

Ala Thr Arg Glu Asp Phe Leu Asn Pro Asp Val Leu Asp He Phe He 



i TCT GTG CAA AAA GTT TTC CAG GTG GTT GAG AGC ACC AGG CCT GGC 107 0 
: Ser Val Gin Lys Val Phe Gin Val Val Glu Ser Thr Arg Pro Gly 



AAG AAG GTC TGG TTA GGA GAA ACA AGC TCT GCA TAT GGA GGC GGA GCG 1118 

Lys Lys Val Trp Leu Gly Glu Thr Ser Ser Ala Tyr Gly Gly Gly Ala 

340 345 350 

CCC TTG CTA TCC GAC ACC TTT GCA GCT GGC TTT ATG TGG CTG GAT AAA 1166 

Pro Leu Leu Ser Asp Thr Phe Ala Ala Gly Phe Met Trp Leu Asp Lys 



TTG GGC CTG TCA GCC CGA ATG GGA ATA GAA GTG GTG ATG AGG CAA GTA 1214 
Leu Gly Leu Ser Ala Arg Met Gly He Glu Val Val Met Arg Gin Val 



TTC TTT GGA GCA GGA AAC TAC CAT TTA GTG GAT GAA AAC TTC GAT CCT 1262 

Phe Phe Gly Ala Gly Asn Tyr His Leu Val Asp Glu Asn Phe Asp Pro 

385 390 395 400 

TTA CCT GAT TAT TGG CTA TCT CTT CTG TTC AAG AAA TTG GTG GGC ACC 1310 

Leu Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu Val Gly Thr 



AAG GTG TTA ATG GCA AGC GTG CAA GGT TCA AAG AGA AGG AAG CTT CGA 1358 
Lys Val Leu Met Ala Ser Val Gin Gly Ser Lys Arg Arg Lys Leu Arg 



GTA TAC CTT CAT TGC ACA AAC ACT GAC AAT CCA AGG TAT AAA GAA GGA 140 6 
Val Tyr Leu His Cys Thr Asn Thr Asp Asn Pro Arg Tyr Lys Glu Gly 



GAT TTA ACT CTG TAT GCC ATA AAC CTC CAT AAC GTC ACC AAG TAC TTG 1454 
Asp Leu Thr Leu Tyr Ala He Asn Leu His Asn Val Thr Lys Tyr Leu 



CGG TTA CCC TAT CCT TTT TCT AAC AAG CAA GTG GAT AAA TAC CTT CTA 1502 
Arg Leu Pro Tyr Pro Phe Ser Asn Lys Gin Val Asp Lys Tyr Leu Leu 



AGA CCT TTG GGA CCT CAT GGA TTA CTT TCC AAA TCT GTC CAA CTC AAT 1550 
Arg Pro Leu Gly Pro His Gly Leu Leu Ser Lys Ser Val Gin Leu Asn 



GGT CTA ACT CTA AAG ATG GTG GAT GAT CAA ACC TTG CCA CCT TTA ATG 1598 
Gly Leu Thr Leu Lys Met Val Asp Asp Gin Thr Leu Pro Pro Leu Met 
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GAA AAA CCT CTC CGG CCA GGA AGT TCA CTG GGC TTG CCA GCT TTC TCA 164 6 
Glu Lys Pro Leu Arg Pro Gly Ser Ser Leu Gly Leu Pro Ala Phe Ser 
515 520 525 

TAT AGT TTT TTT GTG ATA AGA AAT GCC AAA GTT GCT GCT TGC ATC TGA 1694 
Tyr Ser Phe Phe Val He Arg Asn Ala Lys Val Ala Ala Cys He 
530 535 540 543 

AAA TAA AAT ATA CTA GTC CTG ACA CTG 1721 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 

(B) TYPE: nuclexc acid 
<C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

{xi> SEQUENCE DESCRIPTION: SEQ ID NO : 12 

CTGGCAAGAA GGTCTGGTTG GGAGAGACGA GCTCAGCTTA CGGTGGCGGT GCACCCTTGC 60 

TGTCCAACAC CTTTGCAGCT GGCTTTATGT GGCTGGATAA ATTGGGCCTG TCAGCCCAGA 120 

TGGGCATAGA AGTCGTGATG AGGCAGGTGT TCTTCGGAGC AGGCAACTAC CACTTAGTGG 180 

ATGAAAACTT TGAGCCTTTA CCTGATTACT GGCTCTCTCT TCTGTTCAAG AAACTGGTAG 24 0 

GTCCCAGGGT GTTACTGTCA AGAGTGAAAG GCCCAGACAG GAGCAAACTC CGAGTGTATC 300 

TCCACTGCAC TAACGTCTAT CACCCACGAT ATCAGGAAGG AGATCTAACT CTGTATGTCC 3 60 

TGAACCTCCA TAATGTCACC AAGCACTTGA AGGTACCGCC TCCGTTGTTC AGGAAACCAG 420 

TGGATACGTA CCTTCTGAAG CCTTCGGGGC CGGATGGATT ACTTTCCAAA TCTGTCCAAC 480 

TGAACGGTCA AATTCTGAAG ATGGTGGATG AGCAGACCCT GCCAGCTTTG ACAGAAAAAC 540 

CTCTCCCCGC AGGAAGTGCA CTAAGCCTGC CTGCCTTTTC CTATGGTTTT TTTGTCATAA 600 

GAAATGCCAA AATCGCTGCT TGTATATGAA AATAAAAGGC ATACGGTACC CCTGAGACAA 660 

AAGCCGAGGG GGGTGTTATT CATAAAACAA AACCCTAGTT TAGGAGGCCA CCTCCTTGCC 720 

GAGTTCCAGA GCTTCGGGAG GGTGGGGTAC ACTTCAGTAT TACATTCAGT GTGGTGTTCT 78 0 

CTCTAAGAAG AAT ACT GCAG GTGGTGACAG TTAATAGCAC TGTG 82 4 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1899 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

GGGAAAGCGA GCAAGGAAGT AGGAGAGAGC CGGGCAGGCG GGGCGGGGTT GGATTGGGAG 60 

CAGTGGGAGG GAT GCAGAAG AGGAGTGGGA GGGATGGAGG GCGCAGTGGG AGGGGTGAGG 120 

AGGCGTAACG GGGCGGAGGA AAGGAGAAAA GGGCGCTGGG GCTCGGCGGG AGGAAGTGCT 180 

AGAGCTCTCG ACTCTCCGCT GCGCGGCAGC TGGCGGGGGG AGCAGCCAGG TGAGCCCAAG 240 

ATGCTGCTGC GCTCGAAGCC TGCGCTGCCG CCGCCGCTGA TGCTGCTGCT CCTGGGGCCG 300 

CTGGGTCCCC TCTCCCCTGG CGCCCTGCCC CGACCTGCGC AAGCACAGGA CGTCGTGGAC 360 

CTGGACTTCT TCACCCAGGA GCCGCTGCAC CTGGTGAGCC CCTCGTTCCT GTCCGTCACC 420 

ATTGACGCCA ACCTGGCCAC GGACCCGCGG TTCCTCATCC TCCTGGGTTC TCCAAAGCTT 4 80 

CGTACCTTGG CCAGAGGCTT GTCTCCTGCG TACCTGAGGT TTGGTGGCAC CAAGACAGAC 540 

TTCCTAATTT TCGATCCCAA GAAGGAATCA ACCTTTGAAG AGAGAAGTTA CTGGCAATCT 600 

CAAGTCAACC AGGATATTTG CAAATATGGA TCCATCCCTC CTGATGTGGA GGAGAAGTTA 660 

CGGTTGGAAT GGCCCTACCA GGAGCAATTG CTACTCCGAG AACACTACCA GAAAAAGTTC 720 

AAGAACAGCA CCTACTCAAG AAGCTCTGTA GATGTGCTAT ACACTTTTGC AAACTGCTCA 7 80 

GGACTGGACT TGATCTTTGG CCTAAATGCG TTATTAAGAA CAGCAGATTT GCAGTGGAAC 8 40 

AGTTCTAATG CTCAGTTGCT CCTGGACTAC TGCTCTTCCA AGGGGTATAA CATTTCTTGG 900 
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GAACTAGGCA ATGAACCTAA CAGTTTCCTT AAGAAGGCTG ATATTTTCAT CAATGGGTCG 960 

CAGTTAGGAG AAGATTATAT TCAATTGCAT AAACTTCTAA GAAAGTCCAC CTTCAAAAAT 1020 

GCAAAACTCT ATGGTCCTGA TGTTGGTCAG CCTCGAAGAA AGACGGCTAA GATGCTGAAG 1080 

AGCTTCCTGA AGGCTGGTGG AGAAGTGATT GATTCAGTTA CATGGCATCA CTACTATTTG 1140 

AATGGACGGA CTGCTACCAG GGAAGATTTT CTAAACCCTG ATGTATTGGA CATTTTTATT 1200 

TCATCTGTGC AAAAAGTTTT CCAGGTGGTT GAGAGCACCA GGCCTGGCAA GAAGGTCTGG 12 60 

TTAGGAGAAA CAAGCTCTGC ATATGGAGGC GGAGCGCCCT TGCTATCCGA CACCTTTGCA 1320 

GCTGGCTTTA TGTGGCTGGA TAAATTGGGC CTGTCAGCCC GAATGGGAAT AGAAGTGGTG 1380 

ATGAGGCAAG TATTCTTTGG AGCAGGAAAC TACCATTTAG TGGATGAAAA CTTCGATCCT 14 40 

TTACCTGATT ATTGGCTATC TCTTCTGTTC AAGAAATTGG TGGGCACCAA GGTGTTAATG 1500 

GCAAGCGTGC AAGGTTCAAA GAGAAGGAAG CTTCGAGTAT ACCTTCATTG CACAAACACT 15 60 

GACAATCCAA GGTATAAAGA AGGAGATTTA ACTCTGTATG CCATAAACCT CCATAACGTC 1620 

ACCAAGTACT TGCGGTTACC CTATCCTTTT TCTAACAAGC AAGTGGATAA ATACCTTCTA 1680 

AGACCTTTGG GACCTCATGG ATTACTTTCC AAATCTGTCC AACTCAATGG TCTAACTCTA 17 40 

AAGATGGTGG AT GAT C AAAC CTTGCCACCT TTAATGGAAA AACCTCTCCG GCCAGGAAGT 1800 

TCACTGGGCT TGCCAGCTTT CTCATATAGT TTTTTTGTGA TAAGAAATGC CAAAGTTGCT 18 60 

GCTTGCATCT GAAAATAAAA TATACTAGTC CTGACACTG 18 99 

(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 

(B) TYPE: amino acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

Met Glu Gly Ala Val Gly Gly Val Arg Arg Arg Asn Gly Ala Glu 

5 10 15 

Glu Arg Arg Lys Gly Arg Trp Gly Ser Ala Gly Gly Ser Ala Arg 

20 25 30 

Ala Leu Asp Ser Pro Leu Arg Gly Ser Trp Arg Gly Glu Gin Pro 

35 40 45 

Gly Glu Pro Lys Met Leu Leu Arg Ser Lys Pro Ala Leu Pro Pro 

50 55 60 

Pro Leu Met Leu Leu Leu Leu Gly Pro Leu Gly Pro Leu Ser Pro 

65 70 75 

Gly Ala Leu Pro Arg Pro Ala Gin Ala Gin Asp Val Val Asp Leu 

80 85 90 

Asp Phe Phe Thr Gin Glu Pro Leu His Leu Val Ser Pro Ser Phe 

95 100 105 

Leu Ser Val Thr He Asp Ala Asn Leu Ala Thr Asp Pro Arg Phe 

110 115 120 

Leu He Leu Leu Gly Ser Pro Lys Leu Arg Thr Leu Ala Arg Gly 

125 130 135 

Leu Ser Pro Ala Tyr Leu Arg Phe Gly Gly Thr Lys Thr Asp Phe 

140 145 150 

Leu He Phe Asp Pro Lys Lys Glu Ser Thr Phe Glu Glu Arg Ser 

155 160 165 

Tyr Trp Gin Ser Gin Val Asn Gin Asp lie Cys Lys Tyr Gly Ser 

170 175 180 

He Pro Pro Asp Val Glu Glu Lys Leu Arg Leu Glu Trp Pro Tyr 

185 190 195 

Gin Glu Gin Leu Leu Leu Arg Glu His Tyr Gin Lys Lys Phe Lys 

200 205 210 

Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Val Leu Tyr Thr Phe 

215 220 225 

Ala Asn Cys Ser Gly Leu Asp Leu He Phe Gly Leu Asn Ala Leu 
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230 235 240 

Leu Arg Thr Ala Asp Leu Gin Trp Asn Ser Ser Asn Ala Gin Leu 

245 250 255 

Leu Leu Asp Tyr Cys Ser Ser Lys Gly Tyr Asn He Ser Trp Glu 

260 265 270 

Leu Gly Asn Glu Pro Asn Ser Phe Leu Lys Lys Ala Asp He Phe 

275 280 285 

He Asn Gly Ser Gin Leu Gly Glu Asp Tyr He Gin Leu His Lys 

290 295 300 

Leu Leu Arg Lys Ser Thr Phe Lys Asn Ala Lys Leu Tyr Gly Pro 

305 310 315 

Asp Val Gly Gin Pro Arg Arg Lys Thr Ala Lys Met Leu Lys Ser 

320 325 330 

Phe Leu Lys Ala Gly Gly Glu Val He Asp Ser Val Thr Trp His 

335 340 345 

His Tyr Tyr Leu Asn Gly Arg Thr Ala Thr Arg Glu Asp Phe Leu 

350 355 360 

Asn Pro Asp Val Leu Asp He Phe He Ser Ser Val Gin Lys Val 

365 370 375 

Phe Gin Val Val Glu Ser Thr Arg Pro Gly Lys Lys Val Trp Leu 

380 385 390 

Gly Glu Thr Ser Ser Ala Tyr Gly Gly Gly Ala Pro Leu Leu Ser 

395 400 405 

Asp Thr Phe Ala Ala Gly Phe Met Trp Leu Asp Lys Leu Gly Leu 

410 415 420 

Ser Ala Arg Met Gly He Glu Val Val Met Arg Gin Val Phe Phe 

425 430 435 

Gly Ala Gly Asn Tyr His Leu Val Asp Glu Asn Phe Asp Pro Leu 

440 445 450 

Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu Val Gly Thr 

455 460 465 

Lys Val Leu Met Ala Ser Val Gin Gly Ser Lys Arg Arg Lys Leu 

470 475 480 

Arg Val Tyr Leu His Cys Thr Asn Thr Asp Asn Pro Arg Tyr Lys 

485 490 495 

Glu Gly Asp Leu Thr Leu Tyr Ala lie Asn Leu His Asn Val Thr 

500 505 510 

Lys Tyr Leu Arg Leu Pro Tyr Pro Phe Ser Asn Lys Gin Val Asp 

515 520 525 

Lys Tyr Leu Leu Arg Pro Leu Gly Pro His Gly Leu Leu Ser Lys 

530 535 540 

Ser Val Gin Leu Asn Gly Leu Thr Leu Lys Met Val Asp Asp Gin 

545 550 555 

Thr Leu Pro Pro Leu Met Glu Lys Pro Leu Arg Pro Gly Ser Ser 

560 565 570 

Leu Gly Leu Pro Ala Phe Ser Tyr Ser Phe Phe Val lie Arg Asn 

575 580 585 

Ala Lys Val Ala Ala Cys He 

590 592 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1899 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
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GGG 3 

AAA GCG AGC AAG GAA GTA GGA GAG AGC CGG GCA GGC GGG GCG GGG 4 8 
TTG GAT TGG GAG CAG TGG GAG GGA TGC AGA AGA GGA GTG GGA GGG 93 
ATG GAG GGC GCA GTG GGA GGG GTG AGG AGG CGT AAC GGG GCG GAG 138 
Met Glu Gly Ala Val Gly Gly Val Arg Arg Arg Asn Gly Ala Glu 
5 10 15 

GAA AGG AGA AAA GGG CGC TGG GGC TCG GCG GGA GGA AGT GCT AGA 183 
Glu Arg Arg Lys Gly Arg Trp Gly Ser Ala Gly Gly Ser Ala Arg 
20 25 30 

GCT CTC GAC TCT CCG CTG CGC GGC AGC TGG CGG GGG GAG CAG CCA 228 
Ala Leu Asp Ser Pro Leu Arg Gly Ser Trp Arg Gly Glu Gin Pro 
35 40 45 

GGT GAG CCC AAG ATG CTG CTG CGC TCG AAG CCT GCG CTG CCG CCG 273 
Gly Glu Pro Lys Met Leu Leu Arg Ser Lys Pro Ala Leu Pro Pro 
50 55 60 

CCG CTG ATG CTG CTG CTC CTG GGG CCG CTG GGT CCC CTc TCC CCT 318 

Pro Leu Met Leu Leu Leu Leu Gly Pro Leu Gly Pro Leu Ser Pro 
65 70 75 

GGC GCC CTG CCC CGA CCT GCG CAA GCA CAG GAC GTc GTG GAC CTG 3 63 
Gly Ala Leu Pro Arg Pro Ala Gin Ala Gin Asp Val Val Asp Leu 
80 85 90 

GAC TTc TTC ACC CAG GAG CCG CTG CAC CTG GTG AGC CCC TCG TTC 408 
Asp Phe Phe Thr Gin Glu Pro Leu His Leu Val Ser Pro Ser Phe 
95 100 105 

CTG TCC GTC ACC ATT GAC GCC AAC CTG GCC ACG GAC CCG CGG TTC 453 
Leu Ser Val Thr lie Asp Ala Asn Leu Ala Thr Asp Pro Arg Phe 
110 115 120 

CTC ATC CTC CTG GGT TCT CCA AAG CTT CGT ACC TTG GCC AGA GGC 4 98 
Leu He Leu Leu Gly Ser Pro Lys Leu Arg Thr Leu Ala Arg Gly 
125 130 135 

TTG TCT CCT GCG TAC CTG AGG TTT GGT GGC ACC AAG ACA GAC TTC 543 
Leu Ser Pro Ala Tyr Leu Arg Phe Gly Gly Thr Lys Thr Asp Phe 
140 145 150 

CTA ATT TTC GAT CCC AAG AAG GAA TCA ACC TTT GAA GAG AGA AGT 588 
Leu He Phe Asp Pro Lys Lys Glu Ser Thr Phe Glu Glu Arg Ser 
155 160 165 

TAC TGG CAA TCT CAA GTC AAC CAG GAT ATT TGC AAA TAT GGA TCC 633 
Tyr Trp Gin Ser Gin Val Asn Gin Asp lie Cys Lys Tyr Gly Ser 
170 175 180 

ATC CCT CCT GAT GTG GAG GAG AAG TTA CGG TTG GAA TGG CCC TAC 67 8 
lie Pro Pro Asp Val Glu Glu Lys Leu Arg Leu Glu Trp Pro Tyr 
185 190 195 

CAG GAG CAA TTG CTA CTC CGA GAA CAC TAC CAG AAA AAG TTC AAG 723 



Gin Glu Gin Leu 



AAC AGC ACC TAC 
Asn Ser Thr Tyr 

GCA AAC TGC TCA 
Ala Asn Cys Ser 

TTA AGA ACA GCA 
Leu Arg Thr Ala 



CTC CTG GAC TAC 
Leu Leu Asp Tyr 



CTA GGC AAT GAA 
Leu Gly Asn Glu 



ATC AAT GGG TCG 

He Asn Gly Ser 

CTT CTA AGA AAG 

Leu Leu Arg Lys 

GAT GTT GGT CAG 

Asp Val Gly Gin 



TTC CTG AAG GCT 
Phe Leu Lys Ala 



CAC TAC TAT TTG 
His Tyr Tyr Leu 



AAC CCT GAT GTA 
Asn Pro Asp Val 



TTC CAG GTG GTT 
Phe Gin Val Val 



GGA GAA ACA AGC 
Gly Glu Thr Ser 



GAC ACC TTT GCA 



Leu Leu Arg Glu 
200 

TCA AGA AGC TCT 
Ser Arg Ser Ser 
215 

GGA CTG GAC TTG 
Gly Leu Asp Leu 
230 

GAT TTG CAG TGG 
Asp Leu Gin Trp 
245 

TGC TCT TCC AAG 
Cys Ser Ser Lys 
2 60 

CCT AAC AGT TTC 
Pro Asn Ser Phe 
275 

CAG TTA GGA GAA 
Gin Leu Gly Glu 
290 

TCC ACC TTC AAA 
Ser Thr Phe Lys 
305 

CCT CGA AGA AAG 
Pro Arg Arg Lys 
320 

GGT GGA GAA GTG 
Gly Gly Glu Val 
335 

AAT GGA CGG ACT 
Asn Gly Arg Thr 
350 

TTG GAC ATT TTT 
Leu Asp He Phe 
365 

GAG AGC ACC AGG 
Glu Ser Thr Arg 



TCT GCA TAT GGA 

Ser Ala Tyr Gly 
395 

GCT GGC TTT ATG 

Ala Gly Phe Met 
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His Tyr Gin Lys 
205 

GTA GAT GTG CTA 
Val Asp Val Leu 
22 0 

ATC TTT GGC CTA 
He Phe Gly Leu 
235 

AAC AGT TCT AAT 
Asn Ser Ser Asn 
250 

GGG TAT AAC ATT 
Gly Tyr Asn He 
265 

CTT AAG AAG GCT 
Leu Lys Lys Ala 
280 

GAT TAT ATT CAA 
Asp Tyr He Gin 
2 95 

AAT GCA AAA CTC 
Asn Ala Lys Leu 
310 

ACG GCT AAG ATG 
Thr Ala Lys Met 
325 

ATT GAT TCA GTT 
He Asp Ser Val 
340 

GCT ACC AGG GAA 
Ala Thr Arg Glu 
355 

ATT TCA TCT GTG 
He Ser Ser Val 
370 

CCT GGC AAG AAG 
Pro Gly Lys Lys 
385 

GGC GGA GCG CCC 
Gly Gly Ala Pro 
400 

TGG CTG GAT AAA 
Trp Leu Asp Lys 



Lys Phe Lys 
210 

TAC ACT TTT 7 68 
Tyr Thr Phe 
225 

AAT GCG TTA 813 
Asn Ala Leu 
240 

GCT CAG TTG 858 
Ala Gin Leu 
255 

TCT TGG GAA 903 
Ser Trp Glu 
270 

GAT ATT TTC 94 8 
Asp He Phe 
285 

TTG CAT AAA 993 
Leu His Lys 
300 

TAT GGT CCT 1038 
Tyr Gly Pro 
315 

CTG AAG AGC 1083 
330 

ACA TGG CAT 112 8 
Thr Trp His 
345 

GAT TTT CTA 117 3 
Asp Phe Leu 
360 

CAA AAA GTT 1218 
Gin Lys Val 
375 

GTC TGG TTA 12 63 
Val Trp Leu 
390 

TTG CTA TCC 1308 
405 

TTG GGC CTG 1353 
Leu Gly Leu 



410 
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415 



420 



TCA GCC CGA ATG GGA ATA gAA GTG GTG ATG AGG CAA GTA TTC TTT 1398 
Ser Ala Arg Met Gly lie Glu Val Val Met Arg Gin Val Phe Phe 
425 430 435 

GGA GCA GGA AAC TAC CAT TTA GTG GAT GAA AAC TTC GAT CCT TTA 1443 
Gly Ala Gly Asn Tyr His Leu Val Asp Glu Asn Phe Asp Pro Leu 
440 445 450 

CCT GAT TAT TGG CTA TCT CTT CTG TTC AAG AAA TTG GTG GGC ACC 14 88 
Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu Val Gly Thr 
455 460 465 

AAG GTG TTA ATG GCA AGC GTG CAA GGT TCA AAG AGA AGG AAG CTT 1533 
Lys Val Leu Met Ala Ser Val Gin Gly Ser Lys Arg Arg Lys Leu 
470 475 480 

CGA GTA TAC CTT CAT TGC ACA AAC ACT GAC AAT CCA AGG TAT AAA 1578 
Arg Val Tyr Leu His Cys Thr Asn Thr Asp Asn Pro Arg Tyr Lys 
485 490 495 

GAA GGA GAT TTA ACT CTG TAT GCC ATA AAC CTC CAT AAC GTC ACC 1623 
Glu Gly Asp Leu Thr Leu Tyr Ala lie Asn Leu His Asn Val Thr 
500 505 510 

AAG TAC TTG CGG TTA CCC TAT CCT TTT TCT AAC AAG CAA GTG GAT 1668 
Lys Tyr Leu Arg Leu Pro Tyr Pro Phe Ser Asn Lys Gin Val Asp 
515 520 525 

AAA TAC CTT CTA AGA CCT TTG GGA CCT CAT GGA TTA CTT TCC AAA 1713 
Lys Tyr Leu Leu Arg Pro Leu Gly Pro His Gly Leu Leu Ser Lys 
530 535 540 

TCT GTC CAA CTC AAT GGT CTA ACT CTA AAG ATG GTG GAT GAT CAA 1758 

Ser Val Gin Leu Asn Gly Leu Thr Leu Lys Met Val Asp Asp Gin 

545 550 555 

ACC TTG CCA CCT TTA ATG GAA AAA CCT CTC CGG CCA GGA AGT TCA 1803 
Thr Leu Pro Pro Leu Met Glu Lys Pro Leu Arg Pro Gly Ser Ser 
560 565 570 

CTG GGC TTG CCA GCT TTC TCA TAT AGT TTT TTT GTG ATA AGA AAT 1848 
Leu Gly Leu Pro Ala Phe Ser Tyr Ser Phe Phe Val lie Arg Asn 
575 580 585 

GCC AAA GTT GCT GCT TGC ATC TGA AAA TAA AAT ATA CTA GTC CTG 1893 
Ala Lys Val Ala Ala Cys He 
590 592 

ACA CTG 1899 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 



ATTACTATAG GGCACGCGTG GTCGACGGCC 
TAAAGAATTT TGGGTGGTTG ATCTCTTTCC 
TTTTTTCAGG CAAAAGTAAA ATACCTGAGA 
GGCTGGCTCA AGTGACAAGC AAGTGTTTAT 
TCCATTGGAG GCTTTACTCG AGGGTCAGAG 
GGAGTCGGAA ACGCTGGGTT CCCACGAGAG 
TCCGGGATGC CCAGCGCTGC TCCCCGGGCG 
CCGGGCGCTT GGATCCCGGC CATCTCCGCA 
GTGAACGTGA CCGCCACCGG GGGGAAAGCG 
GGGGCGGGGT TGGATTGGGA GCAGTGGGAG 



CGGGCTGGTA TTGTCTTAAT GAGAAGTTGA 60 

AGCTGCAGTT TAGCGTATGC TGAGGCCAGA 120 

AACTGCCTGG CCAGAGGACA ATCAGATTTT 180 

AAGCTAGATG GGAGAGGAAG GGATGAATAC 240 

GGATACCCGG CGCCATCAGA ATGGGATCTG 300 

CGCGCAGAAC ACGTGCGTCA GGAAGCCTGG 3 60 

CTCCTCCCCG GGCGCTCCTC CCCAGGCCTC 420 

CCCTTCAAGT GGGTGTGGGT GATTTCGTAA 480 

AGCAAGGAAG TAGGAGAGAG CCGGGCAGGC 540 

GGATGCAGAA GAGGAGTGGG AGGG 594 



INFORMATION FOR SEQ ID NO: 17: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17 
CCCCAGGAGC AGCAGCATCA G 21 



INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
AGGCTTCGAG CGCAGCAGCA T 21 



INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucle 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linea 
(xi) SEQUENCE DESCRIPTION: SEQ I 

GTAATACGAC TCACTATAGG GC 22 



INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
ACTATAGGGC ACGCGTGGT 19 



INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CTTGGGCTCA CCTGGCTGCT C 21 
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INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 

(B) TYPE: nucleic ; 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID N( 

AGCTCTGTAG ATGTGCTATA CAC 23 

INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic ; 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID N< 

GCATCTTAGC CGTCTTTCTT CG 22 

INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic 

(C) STRANDEDNESS: single 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GAGCAGCCAG GTGAGCCCAA GAT 23 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
TTCGATCCCA AGAAGGAATC AAC 23 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sxngle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26 
AGCTCTGTAG ATGTGCTATA CAC 23 

(2) INFORMATION FOR SEQ ID NO: 27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
TCAGATGCAA GCAGCAACTT TGGC 24 



(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28 
GCATCTTAGC CGTCTTTCTT CG 22 

(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GTAGTGATGC CATGTAACTG AATC 2 4 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
AGGCACCCTA GAGATGTTCC AG 22 



(2) INFORMATION FOR SEQ ID NO:31: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GAAGATTTCT GTTTCCATGA CGTG 24 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32 
CCACACTGAA TGTAATACTG AAGTG 25 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 



TYPE: 

STRANDEDNESS : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
CGAAGCTCTG GAACTCGGCA AG 22 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GCCAGCTGCA AAGGTGTTGG AC 22 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
AACACCTGCC TCATCACGAC TTC 23 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION : SEQ ID NO: 36 
GCCAGGCTGG CGTCGATGGT GA 22 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
GTCGATGGTG ATGGACAGGA AC 22 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
GTAATACGAC TCACTATAGG GC 22 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39 
ACTATAGGGC ACGCGTGGT 19 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40 
CCATCCTAAT ACGACTCACT ATAGGGC 27 
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(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41 
ACTCACTATA GGGCTCGAGC GGC 23 



(2) INFORMATION FOR SEQ ID NO: 42: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 44£ 



(C) STRANDEDNESS: double 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 

GGATCTTGGC TCACTGCAAT CTCTGCCTCC CATGCAATTC TTATGCATCA 50 

GCCTCCTGAG TAGCTTGGAT TATAGGTCTG CGCCACCACT CCTGGCTACA 100 

CCATGTTGCC CAGGCTGGTC TTGAACTCTT GGGCTCTAGT GATCCACCCG 150 

CCTTGGCCTC CCAAAGTGCT GGGATTACAG GTGTGAGCCA TCACACCCGG 200 

CCCCCCGTTT CCATATTAGT AACTCACATG TAGACCACAA GGATGCACTA 250 

TTTAGAAAAC TTGCAATGGT CCACTTTTCA AATCACCCAA ACATGTTAAA 300 

GAAATTGGTA TGACTGGGCA TGGCACAGTG GCTCATGCCT GCAATCCTAG 350 

CATTTTGTGA GGCTGAGACG GGCAGATCAC GAGGTCAGGA GATTGAGACC 400 

ATCCTGACAG ACATGGTGAA ATCCCATCTC TACTAAAAAT ACAAAACAAT 450 

TAGCCGGGGG TGATGGCAGG CCCCTGTAGT CCCAGCTACT CGGGAGGCTG 500 

AGGCAGGAGA ATGGCGTGAA TCCAGGAGGC AGAGCTTGCA GTGAGCCGAG 550 

ATGGTGCCAC TGCACTCCAG CCTGGGCGAC AGAGCGAGAC TCCGTCTCAA 600 

AAAAAAAAAA AAAGAAAGAA ATTGGTATGA CTGTTGACTC ACAACAGGAG 65 0 

TCAGGGGCAT GGGGTGGGGT GTAAGATTAA TGTCATGACA AATGTGGAAA 700 

AGAAACTTCT GTTTTTCCAA CTCCACGTCT GCTACCATAT TATTACACTC 750 

TTCTGGTAGT GTGGTGTTTA TGTGTGAATT TTTTTTCATA TGTATACAGT 800 

AATTGTAGGA TATGAACCTG ATTCTAGTTG CAAAACTCAC TATGAGCTTA 850 

GCTTTTAAGT TGCTTAAGAA TAGGTAGATC TATGCAAATA ATGATAATTA 900 

TTATTATTAT TTTAAGAGAG GGTCTCACTT TGTCACCCAG GCTGGAGTGC 950 

AGTGGTGTGA TTAAGGGTCA CTGCAACCTC CACCTCCCAG GCTCAAATAA 1000 

ACCTCCCACC TCAGCCTCCC CAGTAGCTGG AACCACAGGC ACGGGCCACC 1050 

ACGCCTGGCT AATTTTTTGT ATTTTTTGTA GAGATGGGGT TTCATCATGT 1100 

TGCCCAGGCT GTTCTTGAAT TCCTCGGCTC AAGCAATCCT CCCACCTTGG 1150 

CCTCCCAAAA TGCTGGCATC ACAGGCATGA TGGCATCACT GGCATCACAT 1200 

ACCATGCCTG GCCTGATTTA TGCAAATTAG ATATGCATTT CAAAATAATC 1250 

TATTTTTATT TGTTGCCTTA TTGGTGGTAC AATCTCAAGT GGAAAAATCT 130 0 

AAGGGTTTTG GTGTTATTTG CTTACTCAAC CAATATTTAT TAGACTCTTA 1350 

CTAAGCACCA ACATGATCAC ATGCCTGAGC TATGGCTAGC ATAGCGTGTG 1400 

AGACAAACTT AATCTCTGTT TTGGTGGAGC ATATAATCTA GT AG AT GAAG 14 50 

CCAATGTTGA GCAACATCAC AATACTAACA AATTGAGGAT GCTACGAGAG 1500 

TGTCTAACAA ATTGAGGATG CTACGAGAGT GTCTAACAAA TTGAGGATGC 1550 

TATGAGAGTG TGTCATGGAG AGCTGCCTGG AGATTGAGAG AAAGCTTCCT 1600 

TGAGGGAAGT TACATTTCAG CTGAAACACA CTGCCATCTG CTCGAGGTTT 1650 

TGTAACTGCA TTCACATCCC GATTCTGACA CTTCACATCC CGATTCTGAC 1700 

ACTTCACCCA GTTACTGTCT CAGAGCTTGG GTCCGCATGT GTAAAACAAG 1750 

GACAGTATGC ACTTGGCAGG GTTGTGAGAA GGGAAGAGAA CACAAGTAAA 1800 

GCACCTGTAT CAGGCATACA GTAGGCACTA AGCGTGCGAT GCTTGCTATG 1850 

ATTATACATC AGTGTAAGCA TCAAGGAAAA GCT GAAGAAA AGTCTGACCA 1900 

ACAGCGAAAG ATAAATGCGC AGAGGAGAAA TTTGGCAAAG GCTCCAAATT 1950 

CAGGGGCAGT CCGTACTCTA CACTTTGTAT GGGGGCTTCA GGTCCTGAGT 2000 

TCCAGACATT GGAGCAACTA ACCCTTTAAG ATTGCTAAAT ATTGTCTTAA 2050 

TGAGAAGTTG AT AAAGAAT T TTGGGTGGTT GATCTCTTTC CAGCT GCAGT 2100 

TTAGCGTATG CTGAGGCCAG ATTTTTTCAA GCAAAAGTAA AATACCTGAG 2150 

AAACTGCCTG GCCAGAGGAC AATCAGATTT TGGCTGGCTC AAGTGACAAG 2200 

CAAGTGTTTA TAAGCTAGAT GGGAGAGGAA GGGATGAATA CTCCATTGGA 2250 

GGTTTTACTC GAGGGTCAGA GGGATACCCG GCGCCATCAG AATGGGATCT 2300 

GGGAGTCGGA AACGCTGGGT TCCCACGAGA GCGCGCAGAA CACGTGCGTC 2350 

AGGAAGCCTG GTCCGGGATG CCCAGCGCTG CTCCCCGGGC GCTCCTCCCC 2 400 

GGGCGCTCCT CCCCAGGCCT CCCGGGCGCT TGGATCCCGG CCATCTCCGC 2 45 0 

ACCCTTCAAG TGGGTGTGGG TGATTTCGTA AGTGAACGTG ACCGCCACCG 2500 

AGGGGAAAGC GAGCAAGGAA GTAGGAGAGA GCCGGGCAGG CGGGGCGGGG 2550 

TTGGATTGGG AGCAGTGGGA GGGATGCAGA AGAGGAGTGG GAGGGATGGA 2600 

GGGCGCAGTG GGAGGGGTGA GGAGGCGTAA CGGGGCGGAG GAAAGGAGAA 2 650 

AAGGGCGCTG GGGCTCGGCG GGAGGAAGTG CTAGAGCTCT CGACTCTCCG 2700 

CTGCGCGGCA GCTGGCGGGG GGAGCAGCCA GGTGAGCCCA AGATGCTGCT 2750 
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GCGCTCGAAG CCTGCGCTGC CGCCGCCGCT GATGCTGCTG CTCCTGGGGC 2800 

CGCTGGGTCC CCTCTCCCCT GGCGCCCTGC CCCGACCTGC GCAAGCACAG 2850 

GACGTCGTGG ACCTGGACTT CTTCACCCAG GAGCCGCTGC ACCTGGTGAG 2900 

CCCCTCGTTC CTGTCCGTCA CCATTGACGC CAACCTGGCC ACGGACCCGC 2950 

GGTTCCTCAT CCTCCTGGGG TAAGCGCCAG CCTCCTGGTC CTGTCCCCTT 3000 

TCCTGTCCTC CTGACACCTA TGTCTGCCCC GCCAGCGGCT CTCCTTCTTT 3050 

TGCGCGGAAA CAACTTCACA CCGGAACCTC CCCGCCTGTC TCTCCCCACC 3100 

CCACTTCCCG CCTCTCATTC TCCCTCTCCC TCCCTTACTC TCAGACCCCA 3150 

AACCGCTTTT TGGGGGGTAT CATTTAAAAA ATAGATTTAG GGGTTACAAG 3200 

TGCAGTTCTG TTCCATGGGT ATATTGCATT GTGGTGGCAT CTGGGCTCTT 3250 

AGTGTAACTG TCACCCGAAT GTTGTACATT GTATCTAATA GGTAATTTCT 3300 

CATCCCTCAT CCCTCTCCCA CCCTCCCACC TTTTGGAGTC TCCAGTGTCT 3350 

ACTATTCCAC TAAGTCCATG TGTACACATT GTTTAGCGCC CACTCTAAAT 3400 

GAGCCTTTTT GTTTCATTCA TTCTGTAAGT GTTGAATAGG CACCACCTAA 3450 

GGTCAGGTAT AAGTGGAAAT TTGAAAAAGA AACTGCCCAC TTGCCCCAGT 3500 

ACTTCCCTAG CCAAGAGGAG GGAAACCAGG CAGGTGCACC TGAAGGCCTG 3550 

TGAGTGCTTG ATTTGCTGTG CAGTGTAGGA CAAGTAAGAT TGTGCATAGC 3600 

CTTCTGTATT TAAGACTGTG TTAGGAAGAT TTCTCTTTCT TTTCTTTTCT 3650 

TTTTCTTTTT TCTTTTCTTT TTTTTTTTTA GGCAGATGAA AAGGGCGTCA 3700 

CAGAACAGGA ATAAAAATCT AAATATTCAA TAAATGAGAC CTAGGAGACT 3750 

ACTGCAGTGA CTTACAAAGT CCTAATAAAA AGATGTCTCT CCAAAATGGG 3800 

GCTGCAAAAT GTGGTGCTGC CTTATCAGCT CTAAGTTTTT TCCTTACCTG 3850 

AGAAAGAAGG AACCTGATGC AGGTTCAGGG CTCCTGCCCC ATGAATGCAG 3900 

GCTGACTCCA AGATGGGGAG CTACAGGGAC AATCCCAGGT CTTCTAGGCC 3950 

TCTTATTTAG GCCCTGGGAG CCTCCAGAGA TGGCCACATC TTGACCAGCC 4000 

CAGATAGAGG GAAAGATCAC CATTATCTCA CCTCTGTGTC AAATACCTAG 4050 

ATGCTGTCCT CCCTGAGCCC ACACTATAGT TGCCAGCGCT AATTTAATGG 4100 

GTAGTGTACT GGTTAAGAGA TGGACAGACC ATCCTGGCTT GACTCTCAGC 4150 

TCTGGCAAAG ATGAGTGACT TGGTTTTTCC ATATCTCTTG GCCACACCAA 4200 

CCTTGATTTC TTCAGCTGTA GAATGGAATT TCTCAAGCTT GCCTCAAGGA 4250 

TTATTGCCCG AGGATTTGAT GATATGGTAA GAGCTTCTCA GTGTTTGACC 4300 

CATAGTAAGT GTTTGACGTT TCAAACGAAT TGTTTCTTTC TAGGACATGG 4350 

TGAGCATTTG GTAGCCATTC ACCGGTTTTC TGTTTCTTTG GATCATAGTT 4400 

AACCTCTCCT TTTCCTTCTG GCACTACAAT TTTCTGGTGG GGAAGAATCC 4450 

TTACTTTCTG CCCTTCCCCT TAAGGATAGG AAGCTGATAC TAGGCAGCAA 4500 

CTAGTTGGGG GATAGGAAGA TTGTTCCAGA GAAATGCTGA ACCATAGGGC 4550 

TCCAGATCAC AGGACCCCAG TCTTAGCTTG CTGGGGTGTG GGGTGGGGGG 4 600 

GGGCGGTTAC TGAACATGGG TATGAAGTAG ATGTCCATTT ACTGAAATGT 4 650 

GAGGACCTGA GGCCTCTTCT ATTGCTGTAG CCAGCATATT CCCCAACCTC 4700 

TCCCCAAGAA AGGACAGATG GGGGTTCCCC CCTGGAGTAA CAGGTCCAAA 4750 

AGAAAAAACA TACAGTGGGA CTTCCAGGAT CTGGGCCTGA TCACCCAGCA 4800 

GTCAAGCTCC CCGCAATTGA CTAACACCCC CCTAACACGT AGAAATTCCA 4850 

ATCTGCAATT TAGT GAGGAT GATACCTTTA TTCTTCTTAA ATACATCTCT 4900 

TCATTTCCCA GAGCACCCTT TTTTCCCCTC CTCTGCACCT TTTTGTTAAA 4 950 

GACTGGAGTA TAATGAAATA CCAAGAGAGC ATAACATGTG ATACATAAAA 5000 

CTTTTTTTCT GGTTTACAAA ACAGTT C ATT CTTGTCCATA CGTGCTTCTC 5050 

TCCAAGGCTG GCTGCTGTCT GTTCCAGCCC GCTTCGCTTG GAGAGGCCAT 5100 

CTGCCATACC TGCTCCCCAG ACGCATCGAC AAGCACACCC AGAGTGTTAT 5150 

CTGCTAAGAC CTAAAAGAGG GAGGAACCCC CTCTCCTCAT CTAAGACCTA 5200 

GCTTCTAAAT TAGAGTGTGA GGGTCCATCT CCCCAGGAGG GGCACAGGGC 5250 

CCAAACAGCC CAGCCATCTC AGAAGACAAC ACTAAGCTTT GTAGGGGTCC 5300 

ACAGTAGAGG AGAGTAAGAC GCCTGTTGTT TAATTTATTA CAGTTCCTCA 5350 

AAAGTGAAGA TGTGTGGGCG GGATGGCAAG AGCTGAGCAG ACGAAAGCTG 5400 

AAGGAATAAG GAAAGAGAGG AGGACACAAA CAGCTGACAC TTCCTCAGTT 5450 

CTTGTCATTT GCCTGGCCCT GTTCTAAGCA CCTTCTAGGT ATTAATCCAT 5500 

TTAGTCTTGG CTACAACACT GTGAGTAACT AGTTTTGTCA CCCCCATTTT 5550 

AAAAATGAAG AAAGTGAGGC TCAGGGAGGT TAAGTAACTT GGCCACAGTT 5600 

TGAAACTAGA CTCTGATCAC ATGAGATAAT AGTGCCCATA AAAAGGGAAA 5650 

GCAGATTATA TTTTTTAAAG GAAAGAGAGT AGGATATGGT AGAAAAAGAT 5700 

TGTTTGGAAA GGAATTGAGA GATTGATATA ATGAAAAGAA GCATTCACAT 5750 

GAGAGTAACA GTATCAGGGC CCAAACCTTC ATCTAAGGTA CTTCAAAGAG 5800 

GCCTAAGCAA ACTTAGTCAC TGGCGTGGTT CTAGTCTCCA TGATGGCAAA 5850 

TACATTGTGT ACAGCCCAAC TCCACACAAA ACTTAAATAC CAATGATAGA 5900 

GCAATCTAAA ATTTGAAAGA AAAAATCTTT CAATTTGTCG TCTTCCCAGA 5950 

GGGACTTAAT CAAGAAACCA ATCAAAATAC TTCCTAAGCC TAACTGTGTG 6000 

CAGAACTCCA AAGAGAGCCC AGCCCTAAAT CAACACTGTC CAATGGAAAT 6050 

ATAATATAAT GTGGGCCTCA TATGCAAGGT CATATGTAAT TTTAAATTTT 6100 

CTAGTAGCCA TATTAAAAAG GTAAAAAGAA ACAAGTGAAA TTAATTTTAA 6150 

TAATTTTATT TAGTTCAATA GATCCAAAAT GTTTTCTCAG CATGTAATCA 6200 

ATATAAAAAT ATTAATGAGG TATTTATTAT TCCTTTTCTC AAACCAAGTC 6250 

TATTCTATAA TCTGGCGTGT ATTATTTACA GCACTTCTCA GACTATATTT 6300 

CTTTCTTTCT TTTTTTTTTC CGAGACAATT TTGCTCTTGT CACCCAAGCT 6350 

AGAGTACAAT GGCGTTACCT CGGCTCACTG CAACCTCCGC CTCCCGGGTT 6400 

CAAGTTATTC TCCTGCCTCA GTCTCCCAAG TAGCTGGGAC TAGAGGCATG 6450 

CACCACCACG CCTGGCTAAT TGTGTATTTT TAGTAGAGAC AGGGTTTCAC 6500 

CATGTTGGCC AGGCTAATCT CAAACTCCTG AGCTCAGGTG ATATGCCCAC 6550 

CTCGGCCTCC CAAAGTGTTG GGATTACAGG CGTGAGCCAC TGCACCCGGC 6600 

CTCAGATTAA CTATATTTCA AGCGTTCAGT AGCCACATGT AGCTAGTGCT 6650 

ATGGTAGTGG ACAGTACAGA TCTGCATTTC AATTAAGACA CGTATACAAG 6700 

CATAGTTCAC TAATGCACGG TAAAAAAAAG TATAGTGCTG AGTCGGTGGT 6750 



AGAAATCCTA AATACTGCAG AGCAAAAGTG 
GATAATGCAA CCATGCTTGC TTTTCATTGC 
GCAAAGTTCA TCCATTTTTG CCAATTCAAT 
TTTCAATATT AGATTCTTGC ATCTTCATAG 
TAGAAAATTA CTTATCAATG TTAAACACAC 
AAAGAGGTGC AGACTCCCCA TGTGCCTATT 
CCAAAGGGAA ACAAAGGGCT GGGGACAATC 
CTCCTGGGAA GTGCTGTCCC TCTGATTGAG 
ACTAACCCTG TCCACTGTGC CCTGGAGCCC 
GTCCTCCTCA CAGAATATCT CCTCTACCTC 
GCTATTCTCT GATGACACTG TCTTCCCTGT 
GCATATTCTC CCATAGTCCA GTTCTTTTCC 
GGATGACAGC CCACTAGTTT GAACTCCATA 
TTTGACTTGT TACCTTGGGC AAATTACCTC 
TTGTAAAATG ACGATAATAA TGCCATTTGC 
TTGAGTGAAA GAAGGCGGGT AGCTTCCCTA 
CTGATGTGCA TTACGGGTGA TGCCATGACT 
CACATCTGGC TCTCATCCAG TGCTCCTGCT 
TTACTTACTC CCCCTTATTA ACTGAAGACT 
CTCTCCACTT CCTAGTCTCA CCATCATCCT 
GATAAACTGT CTCAGTTTCT TCACTCACAT 
TACACTCAAG TTGTAACAGA ACCAGCTTAT 
ATTTCATCTC AACTCTGTAT TCAGTGACAT 
AGCCATGGTG AGAATATTTA CCATGGAAAT 
AGCACCTTTT TTTCTGAGAG CCAGACCATA 
CCATCATAAC AATTTTTAAA TACCTCCACT 
TACTTCTTCC ATATCTGATT TGAGCTTCTT 
CTTGTAATAA TAACCCCAAA TCCCTGTTCC 
ACTAAACCTG GTTTAGTCCA ACCATATTTT 
GTGGCCCAAA AACCTGGAAA TGGAAAAATA 
ATATTAATAA GCCATTTTAA TGCTTCATTT 
GTATAGCTGG GCTATTGAGC TCTTGCGGGA 
CAGCCACACA GACTGATGTT GCACCAAACA 
CCTGGCCCTT AGTGTTACCC TTAACTCTCC 
CTCTACTTTT TAAAAATCTC TGACTCCACC 
ACATGACCAT ACTTCTGCTT CCCAAAGAAA 
CCTTTTCCTC CTGTCATCAA ATCTGCAGAC 
TTTCCTCCTT TCTCTGATCT CAGTCTGCTT 
CCCGTCCCCT CCCCAACCCC CAAGGACTTC 
CCTCTCCTGT ATCTTCAACT CCTCCCATTT 
CTTTCCCCAA GCCTTTCCCA TCTCAATTAC 
AGAAACCACC CCGTTTCTTC CCTCCCCTCG 
TGCCCTCATG ATGGCACCAT CATTGTGTCA 
ATCATCAATG GCCTTCCTTT GTTGGGAAAC 
TTTGGTCTTT GTTATGGGTT GAATGAGGTT 
AGTCCTAACC CCCAGTACCT CAGAATGTGA 
ATTGCAGACG TTATTAGTTA GGATGAGGTC 
GCTTATCTAA TATGACTGAT GTCCTTATAA 
GACACGCACA TAGGGAGAAT ACCATGTGAT 
AGTCAAAAAG CTATGGGAAC TTAGGAGAAA 
CTGCGCCTAG AGAGGGAGTA TGGCCCTGCC 
TCGGCTTTTC AAAACTGTAA GACAATACAT 
AGTTTGCAGT ACTCTGCGAC TGCAGCCCTA 
GGAGGCATTT GGCAAGGTTG ACAATGGAAG 
TCTGTCGCCT TTCTTGTTGG GGGGTGTTTT 
CTCTCTCTCT AGTTTGTCTT AAACATTGGT 
AGGCCTTCTT TTCACTTCAC ATATTCCCCT 
AGAAATTACT TAAATTACTG CTCATGCAGT 
ACAACTGGCT CTCTGGGAAG AGGGGAGACT 
TTCTGTGGTG TAAATACTCC CTCCATGGCC 
TTAACAACTG GCTCACAAAT TTTCTCCAAA 
GGCCAACAAC GTGGTACAGC CAACTCCAGC 
GAGAGAAGTA ACTTATTTTT GTACAAAAGG 
GCCCCCTTTT TTTCCTTAAC AAACTGCTCT 
TTCTTTTATG CATTCATCTG TTATTTCCAT 
ATTTTTCCTT rA fTTTTCTT GTATATGGTT 
AGTTTTAGTT TTATGGCATG TTTTGCACCC 
CAGAGGGCGT CAATATTATT TTCTCAAAAC 
AGGAGACAAA CAAAAAGGTC CTTAATACCA 
TTGTACTTGG CAGTGTCCAA GTGGTAAACC 
TTTTGTTCAG GAAAGTCTTT GTCTGGCAGC 
GGGCCTTGCT CATTCATTCA CTTAAGTATT 
CCAAGTACTT ATCTAGGTAT CGGGTAGATT 
CTGCTCTCAG GGAGCTTGCA GCAGAGATGG 
CCAAGGAAAT GAAAAAGGAA GTTGATTTCA 
AGAAAATGAA GGCAGCGCAG TGTGATGGAG 
TTTGTACCTC TAAGGACCAG ACTGTGACCC 
TCATGT GATG CCACAGCAAC TTTTCCAGGT 
CAGTCTCTTG CCCAGCCGCG ACTGCTTACA 
AATGAGGTTC CTCTATCATC AAACCCAATC 
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GTACGAACAG CAATCTCAGT 68 00 

AATTTGCTTA TTTTCCTTCA 6850 

AAATATTTAC TGATAAAAAC 6900 

ACAGAGTTGC TTTTCACATT 6950 

GTTTTGATAA CCAGTGTTGG 7000 

GATGGCAGAA ATATTCACAG 7 050 

ACACACCTCA TGTCTCCTAA 7100 

CTCTTATTAT TGCCTTCCCC 7150 

TTTGCAGGGT TACCTGCTCT 7200 

CTTGTCCAAG CTACAACTTG 7250 

AGCCCTTTTG AGTAATGGCT 7300 

TGTTCTCCAG TCTGGCTTCT 7350 

CTGCTATAGT TCAAGTCCCT 7400 

CTTTTGTTCA GGTTCCTTGT 7450 

TTCAGTGGGT TATTTTGAAA 7500 

CACGCTCAGT GTAGACTAGC 7550 

CAGTGTGTTT TCCTCATCTC 7600 

TACGGCACTC TGTCCCCCTC 7 650 

GGCACTGATC TCACAGTTTC 7700 

AGATGACTTC AAGTCACCTA 77 5 0 

TTTTTTATAA CAGATAATGT 7 800 

CCAGCTCATG AAATGTATGC 7850 

CCTGTGGGTA TCTGGAAATC 7 900 

TGGCAAATAC TAAAAAGCAG 7 950 

GCTCTTCTAC TCCATAGCAC 8000 

GAACAGCTTC TTCCTCTCTC 8050 

AATTTATCAT GTGAACCACT 8100 

ATTGTTCTTC CTGCTAAAAT 8150 

CTCTCTTTGG AATCTACAGG 8200 

TTACTTATTA ATTTTAATGT 825 0 

CCAGTCTCAG TGGCCACCCT 8300 

GGAGGGAGTG GACAGTCTCC 8350 

TTTTTTAGCT TCCAGACTTC 8400 

ATTTCTCTGC CTTTCACATT 8450 

TTCACCTTAT CATTCTTAGC 8500 

ATGAGCAATT ACTTCCTTTT 8550 

ATGTCATGCC TAAGTCCAGC 8 600 

CTTCCATTTC TGCCCTGAAT 8 650 

GCTCTATCAG TCACCTCTTC 8700 

TACTGGCTTC TTCCTCAAGC 8750 

CTCCTCGCAC ATGCCTCTGC 8800 

GCAGCCTGTT CTTCCTGTTC 8850 

CTAAAATCAA TCTCTCCGAC 8900 

CTAATAAACA CTTTATCTTA 8950 

ACCCCGAAAT CCATATTAGA 9000 

CTTTATTTGG GAATAGGGTC 9050 

ATACTGGAAT GTGATGGGCT 9100 

CAAGGAGAAA TTTGGAGACA 9150 

GACAGGAGTT ATGGAGTTGG 9200 

GACCTGGAAC AAATCCTTTC 9250 

ACTACCTTGA ATTCAACGTT 9300 

TTCTGTTGTT CAAACCAATT 9350 

ACAAACTAAT ACAGTCTCTT 9400 

CACTTTCTTA CCCCTTTAGG 9450 

CTAACAATTC CTCTCCATCT 9500 

GTTCTTCAGA CTTCTGACCT 9550 

GGGTGGTCTC ACCCACTTCC 9600 

ACTGTGCTGG AAACTGTTTA 9650 

GGTTGATGGT TTTTGCTGAT 9700 

AATTCCAAAC TGCCAACAGT 9750 

TTTAACATTT GGCTTTCACA 9800 

ACACCTCTGC TTTTGTGTCA 9850 

TAAAATAAAA ACACCTGCAG 9900 

AGAAATAGAA TAGCTGAAGC 9950 

GTCACTGTGG TGGTGGGATT 10000 

GAAATACTGT ACCTTTGATC 10050 

ATATTAAATC TAGTTTTTGT 10100 

AAGAAAATAT TTCATTGCAA 10150 

AAACTTTGAA ATGTGATTTC 10200 

CAAACAGTAT TGGGTTTTCA 10250 

GACTTACCCT TACATCAGGC 10300 

TATTAAACAC CAGCGGTGTG 10350 

CTGATAAGTC AGTCAGGTCC 10400 

GGGCTGCAAT AGAGAGTAAG 10450 

GAGAGTGATG AATGCTATGA 10500 

AGTGACCCAA GGTGGTACAG 10550 

AGGTCACTCA CAGATGCCCG 10 600 

GCTCGTTTCC TCCCACTTCC 10650 

AATACAGCTA GAGGAATCTA 10700 

AAAATGCCAA GGAACAGAAT 10750 



CAGTGCCTGG CTGAAGGCAG TGGAACAGGG 
CTGAGGAAGT TCCTCATCTT GGTTTTAGGG 
GCTAGGGGTT GCCAGTCCCT GACATTTCTA 
ATTCCCGGCC TGTATGTGTC TCCTGAGTTC 
CCTGATGGAT GGAAGTATGT TTTTTGGTGT 
TACAAAACTT AGTGCCCCTT CTCCTCCCTG 
ATCACCTGTT CCTCATCCAG CAAATGATAT 
CCCAGGAGTA ATCCTTGACT CCTCCTCAAC 
TAGGCCAGGT ACAATAGCTC ACGCCTATAA 
GAGGCAGGTG GATCATTTGA GGCCAGGAGT 
AAGGTGAAAC CTGTCTCATT TAAAAAAAGT 
ATTATTTCTA CCTCTAAGTG TGTCTTGAAT 
CTGAGCTGTT ACCTTACCTC AGTCCATCAC 
ACCAGAGTCT TGTTCTTAGT CTGGTGAGGT 
CTTCCATGGC TCACCGTTGC CCTCATATAA 
TGGCTTACGG GGCCCTCCGT GATGTGGCCC 
TCTCTCCCAG CCTCTCTGCC CCCATCTCTA 
GCTCGTCAAT GGTGCCAGCT TCTCTTCTAT 
TTTCCCTTCA CCTGGAATGC TTTCTTCAAT 
CTAGATAAGG TTTATTCTTT TTGAATGTCT 
TGAAAAACCT TCTCTAACCA ACCCCCTACC 
AGGAGTCCCT CTGAATGTTT CCATAGCATT 
CTTGTTCGTA TCTATCACTA AACTACAAAT 
CTCTGCCTGG TTCACCATTC ATCTCCAGCA 
AGTCAGCCTG CAACAAATAT TTGTTGAATA 
TCCTTAAGTA AATCTTGCTT TTTTCACCTA 
AGGTGTGGTG GCCCATGCCT GTAATCCCAG 
GGCGGATCAC CTGAGGTCAG GAGTTCAAGA 
AAACCCCATC TCTAATAAAA ATACAAAAAT 
TGCGTATAGT CCCAGCTACT AGGGAGGCTG 
CCCAGGAGGC AGAGGTGGCA GTGAGCCGAG 
CCTGGATGAC AGAGACCCTG TCTCAAAACA 
CACACACACA CACACACACA CACACACACC 
TAACGTGCTT GTTATGGAAC ACTTGTAAAA 
GTCTACCATC TAGCTCACCA CATAATGACC 
ATTCTCTCCT GTATATAAAT ATATATTCTT 
TGAGTACTAT TTATTTATTT TACTGTGGCA 
CTTGCCATTT TAAGGTATGC AGTTTGGTGC 
TGTGCAAATA TCACCACTAT CTATCTCAGA 
TGAAACTCTG TACCCATTAA ACAATAGTGC 
TACAATTTAT TTTTATTTGG GTTTGTACCA 
TCCTTACTTA GTTCAGATTA GCATTTCCAT 
GGATGCCATG ACAGATGCCA TCCTTCCTAG 
TATTTCAGTC AGGGTGAATT CGGGTTGATA 
ATTCTGAGGT TCCTAGTGTC AGAGCCCACC 
GTTACAAACA AAAATATGGT GAGGAGGAAT 
AGACTTACAT TTTGTTCAAT TTCTATCTTT 
AAGAAATACT TTGAAAATTT TACATAGCAT 
TGAGCACATC TTAAAACTTT AAATTTTAGA 
ATATTAAGAG GTACTGGCAA TTTGGCCAGG 
ATCCCAACAC TTTGGGAGGG TGAAGTGGGC 
GTGGAGGCTG CAATGGCCTG AGATCACGCC 
ATGAGAATGA AATCCTGTCT CAAAAAAAAA 
GAAGAAGTAT TGGCAATCAG TGCTCCAGGA 
AAACCTACAT GTAGACAAAC TAATTAGGCC 
TGGTTTAATA TGTTTTCAGA GCATTCCAGG 
CATGTTTGAT ACTTCAGAAA TGTATGACAG 
TTCTGTTTTC TTAGTTTTGC TCATGTAAAT 
TTTTGAGGGA AGGGATTATA GATCATTCTA 
GTACCATTCT AAGCACATGA TAGGCACCCA 
ACAGAATATG CATTTAGAAT TGTTCAAATT 
TTAGAATACT ATATAATTCT AAGTCATTTG 
TTTTCCTTGG TGGGGAATGG TGAAGGGAGG 
AGAGATCCTA AGTCATTTAT AAACTTCTCT 
ACTTTTTAAA AAGTCATTCA CCAAATTGTG 
TTAAATAGAC TTTATTTTTT AGAGCAGTTT 
ATGCAAGGAC AGAGATTTCC CATAAACCCC 
CCTCCCTCAT TATCAACATC CCCACCAGAG 
GAACCTACAC TGACACATCA TTATCACCCA 
GGTTCACTGT CGGTGTACAT TCTATGGGTT 
TGTATCCACC ATTATAGTAA CATACAGAGT 
CCCCTGTTCT CCACCTATTC ATCCCTCCCT 
CCCCTGGTAA CCGCTGATCT TTTTACTGTC 
TTTTTCAGAC AGACACAGAG CTGTCTTTCC 
TTCTTTCTCC CCATCCATCA TAAAAGGCTA 
AACACCATCC TACTTGTCAA GTTAAAACAT 
TGGCTCATGC CTGTAATCTC AGCATTTTGG 
ACTTGAAGCC AGAAGTTTGA GACCAGCCTG 
TCCCTCCACA CACAAACACA CACACACACA 
CACACACACA CACAAAAACA AGCTCTTGCC 
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CCAGCCTGGA GTGGTTCTCT 108 00 

CCATACCTTG TGACCTGTGA 10850 

CTGAGGACTC GCCTGTCTAT 10 900 

CAGACACACA GGGCGAAGCG 10 950 

TCCATTGGTA TCTCAAATTC 11000 

TTCCTCCCCA TCTTCAGTCT 11050 

TACCATCTTC CAAGGAGCTT 11100 

ATCCAATTAA TAATCAAATC 11150 

TCCCAGCACT TTGGGAGGCT 11200 

TCAAGACCAG CCTGGCCAAC 11250 

TATTTTAAAA ACTCAAATCT 11300 

TTATCCATCT CTCTCCATCT 11350 

GTTTTGTCTA CGTTAACATG 11400 

CACTCCAGCT GCTTCAGATC 11450 

AGTTGGCACT CCTGGACATG 11500 

TATTTGCTTC TCCATTCTGT 11550 

GGCACCAACC ACACCCTTCT 11600 

CTCTGGTCTT TGGACAGACT 11650 

CCTACCCCAC TCTCTTTAAT 11700 

AGCAGTGAAA CCATTTCCCC 11750 

CTCAGCCCAA GGTCTAGATT 11800 

TTTAAAGAAT TGCCTATTTA 11850 

TGTATGAGAA CAGCCACTAT 1190O 

ACTAGCATAA TGCCTGGCAG 11950 

AATTAACAGA TGGCTTTATC 12000 

TTAAAACAGA CGCACAGGCC 12050 

CACTTTGGCA GGCTGAGGTG 12100 

CCAGCCTGGC CAACATGGTG 12150 

TAGCTGGGCA TGGTGGTGGG 12200 

AGGCAAGAGA ATCGCTTGAA 12250 

ATCATGCCAC TGTACTCCAG 12300 

CACACACACA CACACACACA 12350 

AAGTTGTATA ATTTAAAATA 12400 

TACAGGAAAG TAATGAAAAA 12450 

ATTGCTATCA TCCTGGCATA 12500 

TTATTGTTAA AATTACACTA 12550 

AAATGCGCAA AACATAAAAT 12600 

ATTCACCACA CTCACATTGT 12650 

ACTTCTTCGT CTTCCCAAAC 12700 

ATCCTCTGTT TTCCCCTCCC 12750 

AACTGAAAAT AGCTGCTTCT 12800 

TTATTTAGCC GTGGTTTTGA 12850 

AGCTCTTTGG GGCTGTCAGG 12900 

ACATTTTAAA ATCTCACTTT 12950 

GTATTTTTAG GGACTCCCAA 13000 

CACTGAAGTT TTAACACAAG 13050 

TAGTTTATTT CCTAAGCATA 13100 

TATACATATT TAATTAAGCA 13150 

TCAGATCTTT AATTCCTAGG 13200 

TGTGGTGGTT CACGCCTATA 13250 

GAATTGCTAG AGCCCAGGAG 13300 

ATCGTACTCC AGCCTGGATG 13350 

AAAAAAAAAA AAAAGAAGAA 13400 

ATAATTTCCT GACTTGAAAT 13450 

ATTCCAAGAG TTGCTAGCAT 13500 

AAGCAGTGTG GCCAGCATTG 13550 

GTGTTTCTCT TACCCAGGTC 13 600 

ATTTATGAAC ATCCTCATCT 13650 

ATTCCATTTT CTAGCATTTG 13700 

TTTGGAGCAT TTTTGGCTTG 13750 

AGAGGTGTCA GTGATGGGAA 13800 

ACTTAAATAC AAAAGAATGA 13850 

CAGGAGTTAA GAAGAGGAGA 13 900 

GGAAAGACAG GTGTGTGAAG 13 950 

TGTGTGTGTG TGTGTGTGTT 14000 

TAGGTTCACA GCAAAATTGA 14050 

CTGCCCACAC ACATGCATAG 14100 

AGGTGTTTGT TCTAGTTGAT 14150 

AAGTCCATAG TTCACGGCAG 14200 

TGAGCAAATG TATAATGACA 14250 

ATTTTCAGTG CCCTGCAAAT 14300 

CTCTGCATTT CCACCCCCAG 14350 

CCATAGTTTC GGACGATCTA 144 00 

CTTAGTTTCT ATTCTATCAT 14450 

TGAGTTTTTT TTAAGTGTTG 14500 

AAGCTCCTGG CTGGGTACAG 14550 

GAGGCTGTGG CAGAAGCATC 14 600 

GGCAACATAG CAAGACCCCA 14 650 

CACACACACA CACACACACA 14700 

AGAATTAGAG CTACAAATTG 147 50 
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CCCTCAGGTT CCTAGAAGAT CAGTCCTTCA ATTAGATTCA GATTGAGATG 14800 

CTTCCTCTTT TAAACAATGA TTCCCTTTCT ATCATGCCCA ATAAGAAAAC 14850 

AAATAAAAAT TAAACAATAC TGCCTGTAAT CTCAGCTACC CAGGAGGCAG 14900 

AAGCAGAACT GCTTCAACCC GGCAAGCAGA AGTTGCAGTG AAGTGAGATC 14950 

GCGCCACTGC ACTCCAGCCT GGGAAACAGA GCAAGATTCT GTCTCAAAAA 15000 

CAAAACAATG TGATTTCCTC CTCTAAGTCC TGCACAGGGA AATGTTAAGA 15050 

AATAGGTCCA CCAGGAAAGA AGGAAGTAAG AATGTTTGAC TAGATTGTCT 15100 

TGGAAAAAAT AGTTATACTT TCTTGCTTGT CTTCCTAACA GTTCTCCAAA 15150 

GCTTCGTACC TTGGCCAGAG GCTTGTCTCC TGCGTACCTG AGGTTTGGTG 15200 

GCACCAAGAC AGACTTCCTA ATTTTCGATC CCAAGAAGGA ATCAACCTTT 15250 

GAAGAGAGAA GTTACTGGCA ATCTCAAGTC AACCAGGGTG AAAATTTTTA 15300 

AAGATTCACT CTATATTTTA ATTAACGTCA GTCCGTCATG AGAATGCTTT 15350 

GAGAAAACTG TTATTTCTCA CACCTAACAA TTAATGAGAT TAACTTCCTC 15400 

TCCCCTCATC TGACCTGTGG AGGAATCTGA ACAAGAGGAG GAGGCAGTGG 15450 

GCAGGTTTCC T TAT CAT GAT GTTTGTCATG TTCAGTGTGA GGCCTCACAA 15500 

AAAAAAAAAA AAAAAAAAAA GGCGTCCTGG ATATAACTGA GAGCTCATTG 15550 

TACAGTAAAT ATTAATAAAA CAGTGATTGT AGCTGAAGGA TAGAACTGCT 15600 

TGGAGGGAGC AAGTGGGTAG AATCGCGTCA AACTAAAGAG CATTTCTAGC 15650 

CAAAGACACA ATGATAGATT GAAGGAT AT T TATTCTAAAT ATAGAATATG 15700 

GGTGAACGAG ATCTGTGGAC TTCTGGGCTC CAACGTTAGA TTCTGATTTT 15750 

AGCAAGCTTG TCAGGGGATT CTGATATTGA AAGGCTGTGG CCTTCACCTG 15800 

AGAAACCTGC CCTAGGGGGC CATGAAAATT TGTCCTGTCT TTCAGAAGTG 15850 

CTATCAGACA TCAAATGGAA GTTAAATCGT ATCTTAACAA TTACTAGGAT 15900 

GGGCGCAGTG ACTCACACCT GTAATCCCAA CACTTTGGGA GGCTGAGGCA 15950 

GGAGGATCAC TTGAGCCCAG GAGTTCGGGA CCAGCCTGGG CAACATAGAG 1600 0 

AGACGTTGTC TCTATTTTTT AATAATTTAA AGAGAAAAAA ATACTGAAAA 16050 

TATTGTATAC ACCACTGAAT TATAATAATG TGTATATAAT GTATATATTC 16100 

ATTATGAGGA ATATTTGATT ATTTCATATA TTATATCTTT TCCTTCTGTT 16150 

TATTTTATCC AGTTATGAAG TATTTAGAAC AATTCATCAG TAATTGGGGC 16200 

TAAATTGACA GAATAGTAAT CAGAGAAAAT AGAAAAAGAC AGATGGGTTA 16250 

TCTTTGAATA CCAGGTTGGA GTTGTTTATG GGTTTGTTTT TTGTTTTGGG 16300 

GGCGTTTTTT TAGACAGAGT CCCACTCTGT TGCCCAGGCT GGAGTGCAGT 16350 

GGCACAAGCA TGGCCCACTG CATCCTTGAC CTCTTGGGCT CAAGCAATCT 16400 

TCCCACCTTA GCCTCCTGAG TAGCTGGGAC CACAGGTGCA TGTCACCACA 16450 

CCCAGCTAAT TTTTTTATTT TTTGTAGAGA CAGTCTTTCT ATGTTATCCA 16500 

GGCTGATCTC AAACTCCTGC ACTCAAGTGA TCCCCCTGCC TTGGCGTCCC 16550 

AAAGTATTGG GATTATAGGC ATAGCCACCA CACCCAACCT AGTTTCTATT 1660 0 

TAGACTTGGC CCTTTCCCAC CAGTCATTTG TGTCCAAAAG ATCTCATAAA 16650 

TGTAGACAGG AAACTGTCCT TTGCTCATCA GTTTTCTTCA TCCTGTGTCT 16700 

AGGGGGATGG TCGGTGGGGG AAACTGGGGT TATGCAAGTT CCTCTGAAAC 16750 

ATCCTCTGTG AGCCCAGGGA TGGATGAGGC ACCAGCCGCC AGCGAGTCAG 16800 

TGTGCAGCTT TCCAGAAAGG AAGTCATCAG CCAGTCAGCC GGCCCTGGCA 16850 

GCCAGCACCC GGCAACCCTG CTGTCTTGTG ATAAAGAAAT GGTCTGCCTG 16900 

ACAGGATGGT GTGGATTTTT CTTTTTTCTT TTTTTTTTTT TTGAGACAGG 16950 

GTCTGGCTCT GTCGCCCAGG CTGGAGTGCA ATGGCGGGAT CTTGGCTCAC 17000 

TGCAGCCTCT GCCTCCCAGG CTCAAGGCAT CCTCCCACCT CGGTCTCCCG 17050 

AGTAGCTGGG ACCACAGGCA CACACCACCA CGCCCAACTA AGTTTTCGTA 17100 

TTTTTAGTAG AGGCAGGGTT TTACTATGTT GTCCAGGCTA GTCTCAAACT 17150 

CCTGAGCTCA AGCTATCCAT CTGCCTTGGC CTCCCAAAGA GCTGGAATTA 17200 

CAAGCGTGAG CCACTGTGCC TGACCAGGGT GGATTTTTTC AAGTGCACAT 17250 

GTTGTGGTCC CAGAAGCTCT GATGGTACCA AATTCCAAGC GAAAAAAAGT 17300 

CAATGGTTCC CACCCATCCT ACCTCCCATG ATGGCAAGAG GAAATCACCA 17350 

CACTGCAGAT ACAGTCCATG TAAAACAAAT TGCTATGGAT TTTGAAAGTG 17400 

AACCTTAAGA GAACTGCACT ATGTTTTCTT CATTAGAGTT CTCTGGTAAT 17450 

TTCCAGCTTT TTTTTTTTTT TTTTTTAGAC AGTGTCTCGC TTTGTCGCCC 17500 

AGTGTCACCC AGGCTGGAGT GCAGTGACGT GATCTCGGCT CACTGCAACC 17550 

TCCGCCTCGT GGGTTGAAGT GATTCTCCTG CCTCAGCCTC CTGAGTAGCT 17600 

GTATTTTAGT AGAGACGAGG TTTCACCATT TGGCCAGGCT GGTCTCGAAC 17650 

TCCTGACCTC AAGTGATTCG CCCATCTCAG CCTCCCAAAG TGCTGGGATT 17700 

ACAGGTGTGA GCCACTGCAC CCGGCCAGTA ATTTCAAGCT TCTGAGGAGC 1775 0 

CCTTTGAATT GTTAAATAAC TTGTAGCTAT GTCCAACATA TCCATGTTCA 17800 

GTGTATGTTC GATATTTCTT AGGAAACCTG CCCTTGGTTG TTTTCTTTGT 17850 

GGTAATTCAT GAGCCGGCAA ATTTGACATG TGTTACAGAA TATACCTTTT 17900 

CTCTGCTCTC CTACCTCATA ACCAGAACTT AATTATCCTG CTTTAGTCAC 17 95 0 

ATAAATAGCT AACTAAATAA ATATATGAGA TTTCAGTCTG CTCACTGTGA 18000 

AAATAGACCT TCTAAATGAT CTCTTCCACT TGCAGATATT TGCAAATATG 18050 

GATCCATCCC TCCTGATGTG GAGGAGAAGT TACGGTTGGA ATGGCCCTAC 18100 

CAGGAGCAAT TGCTACTCCG AGAACAC TAC CAGAAAAAGT TCAAGAACAG 1815 0 

CACCTACTCA AGTAAGAAAT GAAAGGCACC CTAGAGATGT TCCAGCCCCA 18200 

AAGATATTTG AATAGGTTGG ACTCGGGCAC CAATCTAGCA AGTCCTACGG 18250 

AAGTTGTATA AAGCTGAAAA TACTGAAGCA TTTCCCAAAT GGGAAATCCT 18300 

AAACTCAAAA CTTGCTTTTT GGTTTTTTTG TTTGTTTGTT TTTTCTTCAT 18350 

CTGACATTGC TTAGTAGTCA CAGAATGAAA GATAAATCAA TCATTCATGA 18400 

TCTAACAATG ACCTTCAGTG CTCTAAAAAA CTACGGAGTC AAGGAAAACA 18450 

TGAATATATT CCTCATGTAA AATTAAAATA CAGACATATA AAGGGCAAAA 18500 

CATGAACATC ATTCATACCT TGAGGTCCGT CCCCCTCCCA GAAATAACCC 18550 

CCAGTATGCC TTGGTTTAGA GCATTAAGCA GGAGGGCCCT GAGTCACTCC 18 600 

AGACAGTCTT GACCACCAAG CAGCATTCTC TTTTTGTTTC CTCTGTGGCT 18 650 

TTTGCAAACA CAGGGCTAGC TCAGCTACCC ATTAGTATGT TTTCAGTCAC 18700 

TAAAACAGTC TTCCAGTCTT CAAATTAGGA TGACATTGTC ACATGGGGCT 18750 
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TTAAAGCAAG TGAAACAAGG AACCCCCTTT TTTTTTTTTT TTGAGATGGA 18800 

ATCTCACTCT TGTCGCCCAG CCTGGAGTGC AATGGCGCAA TCTTGGCTCA 18850 

CTGCAACCTC CACCTCCCAG GTTCAAGAGA TTCTCCTGCC TTAGCCTCCT 18900 

ATTCATTATG AGGAATATTT GATTATTCAG TTCCTGTAGG GTAAAGATAT 18 950 

TACCCCCGAT CATATTATTG ATTATTGAGT AGCTGAGATT ACAGGTGCCT 19000 

GCCACCACGA CCGGCTAATT TTTTGTATTT TTTAGTAGAG ACAGGGTTTC 19050 

ACCATGTTGG CCAGGCTCCA GGCTCGTCTC GAACTCCTGA CCTCAGGTGA 19100 

TCCACCCACC TCAGCCTCCC AAAGTTCTGG GATTACAGGC GTGAGCCACC 19150 

ACTCCTGGCC ACAATCCTTT TTTAACTATG AAATATATTT TTATCTGAAG 19200 

TTTGATGTTT ATACCCAACT GAGGGATGAT GTTCCCATAT CTCAGTTAAA 19250 

GAAATAACCT GCTCAGATAC TTCAAGCTCT TCTTTTGACT TTTGAAAATA 19300 

AATGATCTTG AAGTTACTAT ACTTTGTTTG GGTTAGTTAA CATTATTTAA 19350 

AGTATATTAT TTTAATTAAT TATCTTTGTA AGATTTTACT GTATACTACC 19400 

TGGAGTTCAA TGTATCAGAT GGATTTCAAA TTTATGTACA TTTTTTATGT 19450 

ATATGGTACA GAAAAAAATG TGATCCATAA GAAATCAGAA AATAGCGCAT 19500 

ATGCTAATAG CTAATGTTGT CCTCTAAAAA ACTTATTTTT GCATTTTTAA 19550 

GAGGGGGATA TACTCTGACA CTTTAATAAG TGTAATTAAT TATTGACTGG 19600 

AATTTGGCAT GAGGCAGGGC CATTTCAGAT CCCATTAAAG GAATGACACA 19650 

TACCAGAGAA CCACAGAAGT AAGGCCACAT TTGTAATAAA TCATTATAGC 19700 

TCTGCTAGGA GAAGACCCAG TTGTATTAGG TAATTAATGG ATTTGCTCTT 19750 

AAAACACATG TCCCGGAAGA TATAGGTGAG TCTTGGGGGG CCGCATTAAA 19800 

CATTATACCA ATGTATCTTA CATTTCTAAG AAAGTTTTAC TACTTTACAG 19850 

GATCTTTCTG TTACCAAAAT GGAAGGTTTC CAACTCCAGG ACTTGGCTTT 19900 

CATAGTTCCT ACACCAGGGG AAATGCCTTC CTTTGCTAAC TATGCAACCA 19950 

GGTTAGTTAG TGTAAGTCCA GCCACCCTGT TGGCAATGCT AAAAGGTACA 20000 

ACAAACACAG AATTTTATTT GCATTTGTAA ACATTTGATT TCTGGCTCGA 20050 

AATTTTCAGT TTTCATGGGC ACGTCATGGA AACAGAAATC TTCTGTGTTT 20100 

AGTTTGGGCA CCTACTCATT GTAGTGACAA ATATTTCAGA AGCCAATAGG 20150 

GGATTCCACA AATTGTTCTG AACCTGTGGC TGAGACTGGT AATGGCTGAG 20200 

TGACATGGGG ACATACCACA AAAGAAGAGG TAGCAAAAGG CTGCTGAGAT 20250 

AAGGACATGT TCATTGCTTA GCTAGTGGCC TGCACCCTTA AAACACATGT 20300 

CCCAGGCTGG GTGCTGTGGC TCACGCCTGT AATCCCAGCA CTTTGGGAGG 20350 

CTGAGGCGGG TGGATTACCT GAGGTCAGGA GTTCGAGACC AACCTGGCCA 20400 

ACATAGTGAA ACCTCATTTC TACTAAAAAT ACAAAAATTA GCCAGGCATG 20450 

GTGGCGGGCG CCTGTAGTCC CAGCTACTCA GGAGGCAGGC AGGAGAATTA 20500 

CTTGAATCTG GGAGGCAG7AG GTTGTGGTGA GCCGAGATTG CGCCACCGCA 20550 

CGCTAGCCTG GGCGACAAAG TGAGACTCTG TCTCAAAAAA ACAAAAACAA 20600 

AAA AC AAAC A AACAAAAAAC AACAACAACA AAAAAACGGG TATCCCAGAA 20 650 

GATACAGGTA AGTTTTCTAA CACAGGTCCT CTTGTATGGT GCGTTCCACT 20700 

TAAGTAGAAG ATGACAAAAA CATTTGTCAT GAGAATATAG ACTCACATTT 20750 

TAAACCTGTT TGAGCAGGAA AAGGAAGCAA TGTTACAGAT GTAATTCTGG 20800 

GTGTGACTGC AGAAAGGATG ACTCCCTTAT TAAAGTAGTC ATCCTGAGTG 20850 

AGCTAACTCT TTGTACTTCC TCTTCTCCTC CTGTTCCCCT CATCACCCCA 20900 

TTCTTCCGTT GCCTACACCC AGGCCCACAT TGGATGCTGA CATAGACTTA 20950 

CATGGTACAG TCCAAGGGAA AGATCTGCCA TTTTTTTCAA TGTGTCATCT 21000 

TGGTTATCTT CATTCCAAGG ATCTCTCCAC TCTTTATACA GTAAGAGATG 2105 0 

AGAGTCTGGA AAGGATTGGG AATAAGATAA TGAATTGTAA GTTTTAAATT 21100 

GTTCTTCGTA TTTTGGGGAA GGAGTAGGCT AGGTGGTCCT TCTGTTTTTT 21150 

TTTTGTTTTT TTTTTTAAAG TAGATGTGGC CAGACGTGGT GGCTCACGCC 21200 

TGTAATCCCA GCACTTTGAG AGGCTGAGGC AGGTGGATCA CTTGATGTCA 21250 

GGAGTTCAAG ACCAGCCTGG CCAACACAGT GAAACCCCGT CTTTACTAAA 21300 

AATACAAAAA CTAGCCGGGC TTGGTGGCGT CCACCTGTAG TCCCAGCTAC 21350 

TGCAGAGGTG GAGGCAGGAG AATCACTTGA ACCCGGGAGG TGGAGGTTGC 21400 

AGTGAGCCAA GATCATGCCA TTGTACTCCA GCCTGGGCGA CAGAACAATA 21450 

CTCTGTCTCA AAAAAAAAGA GAAAAGAAAA GAAAAAAAGA ATGGATTTGA 21500 

ACTCAGTCGT CAATAGCCTC TATTCCAGGA GATGTTACAG TTGATTATGT 21550 

TATAGGGGGT GTATAATAGA ATTTCGAGCT ATGTAAATTC CAAGTGCATT 21600 

TGGAAGAATG AAGAAATGGA GGAAGGGTAA AGTATGAGTG CAAGCATTCC 21650 

AGGTTTTTTG AAAATGCTAT AATCTTTGTT CAGGGCTAGT ACAAAGTGCT 21700 

ATTTAGCTGT AAGGGTTTTT TGTGATTTAC AGACAGTTTT CACATGTGTC 21750 

ATTTCAACCT TGGTTTTATG GCGAAGGCAT GTGATGGTGC TTGTCCCAGG 21800 

ACTTTAGATC CATATCTGAG GTTCCTGTCG GGCAAAGATA TTACCCCTGA 21850 

TCATATTATA GTCTATAAGT GGGAGAGTTG TGCCTGGAGC TCAAGTCTTA 21900 

TGATTTCTGA TCCAGGGCAC TTCCTACAAC ATGATTTTGC AATATAAAAG 21950 

CCTATAATGT GTGACTAAAG CAGGTCACTC ACCCCTTGTA ACAGACTCTA 22000 

GTAATGGTAC TGCCACCAAA CGGCTGCGTG ATATTGGGCA AAGACTTACC 22050 

TTATTTGAAT CTCAGTTTCC TCCTAGAAAA ATGAGGGTGG AGGTTAAGCA 22100 

TAGGCTGATG ATCCTAAAGC CTCCATACTG CCCTAAACTG TGGCTCTAAG 22150 

ATCCAGTAGA ATGCTGGGTC ACAGGACTCT AGGGAGCTTT TCAAACCCAA 22200 

ATGTCTGTCA TTCCTTGATG GTAGGCAGCA GTTTATGGAA GTGGGCGACA 22250 

CAGCAAATAT CAAAATACCT AAAGCAGCTT GCAAGAGTTG TTTCTGCCTA 22300 

GTGGTCTTTA TAGTTAATAT TAAATAGTTA ATTTTTTTTT TTTTTGAGAC 22350 

AGAGTCTTGC TCTGTTACCC AGGCTGCAGT GCAGTGGCAC AATCTCGGCT 22400 

CACTGCAACC TCCACCTCCC GGGTTTGAGC AATTCTGTCT CAGCCTCCCA 22450 

AGTAGCTGGG ACTACAGGTG CATGCCACTG CACCCAGCTA ATTTTTGTAT 22500 

TTTTAGTAGA GACGGGGTTT CACCATATTG GGCAGGCTGG TCTCGAACTC 22550 

TTGACCTCAG GTGATCCACC TGCCTCAGCC TCCCAAAGTG CTGGGATTAC 22600 

AGGCATGAGC CACTGCACCC AGCTTAAATA GCTAATATTT AATATTATTC 22650 

TATAGTTATT CAAGTAATTC AGGCCAAAGA CTTAGAAACA AAACAAAAAG 22700 

CCACTTTTAA GGAGAAAGGG TGTAAGTTTG CCAGATAGAT AGAGATCTTT 22750 



CTTTTTTAAC TACAAGAGTT CAGGAATGAA 
ATAGATATAC AT GAAAATTG GAAGGAC TTA 
TTAAAGACAA CACTTAAAAT TATATTGTTG 
TAGAACAGCT AATGGTTTAA AAAGCAGAGT 
GGCACCTTAA TATCGCAGAA AACTTTTTAA 
AATACCTGTA TTTTGATTAT TAAATTGTAA 
AATAGGTCCA ATAGTAATGC TGTCCAATAG 
GCAAACTTAA AAGATCCTAC AGTGCCTCTG 
GAGTTGAATT TCAGATAAAT AATTTTTTTC 
CTTTACTTTT TTTTTTGTTT TTTTGTTTTT 
GGGTCTCATT CTGTTGCCCA GGCTGCTGTG 
ACTGCAGCCT TGACCTCCCT GGGCTCAGGT 
CCAAGTAGCT AGCTGGGACT ACAGGTGCTT 
TTTGTGTTTT TTGTAGAGAT GTGGTTTTGC 
TGAACTCCTG GGCTCAAGTG ATCCGCCCGC 
GGATGACAGG CATGAGCCAC TGCACCTGGC 
AATGGTTACA TAGGACATAC ACTAAACATT 
TCAAGTTTAA CTAGGTGCCC TGCACTTTTA 
TACCCATGCA TTCACTGGTG CTCCCCAGCT 
AACCATAGTC CTATAACTCT AGGCCAATTT 
ATTTTAAATT AATAAATAAT AACAGGAATT 
TAT AAT T AAA ATTATCAAAA TATTTTTTAA 
ATTTAGATTA TGAAGAGTGG GGTTTATGCT 
TGCATGTGGA GCACTGAGCT ATAAATTGTG 
GTCACTTGAA CAAAACCTAA GTGTCAGACC 
TGGGATTTCA TTCAACAGCT GGAGCAAATG 
AATTTGTCCA ATTTTGTTGT CTCAAAAACA 
AACTAGAATT TCTTCAGTTT AACAACAGAA 
GCGAATCTGG AGGCCTTCAT TGTGGTGCCA 
CGTTTTTCTT TTAGGAAGCT CTGTAGATGT 
GCTCAGGACT GGACTTGATC TTTGGCCTAA 
GATTTGCAGT GGAACAGTTC TAATGCTCAG 
TTCCAAGGGG TATAACATTT CTTGGGAACT 
GGGAACAATT CATTAATAAG GAGATTCCCC 
TTTCTTTTTC TTTTCTTTTT TTTTTTTTTT 
TGCCCAGGCT GGAGTGCAGT GGCGCCACCT 
CTCCCAAAAC GCCATTCTCC TGCCTCAGCC 
AGGCACCCGC CACCGCGCCC GGCTAATTTT 
TTTTTTTGCA TTTTTAGTAG AGACGGGGTT 
GTCTTGATCT CCTGACCTCG TGATCTGCCC 
TGGGATTACA GGCGTGAGCC ACCAGGCCCG 
CACTTTTTTT TTTTTTTTGA GACGGAGTCT 
AGTGCAGTGG CGCCATCTCG GCTCACTGCA 
CGCCATTCTC CTGCCTCAGC CTCCCGAGTA 
CCACCACGCC CGGCTAATTT TTTTGTATTT 
CCGTGTTAGC CAGGATGGTC TCTATATCCT 
TCGGCCTCCC AAAGTGGTGG GATTACAGGC 
AACACTCTTT TTATTATTAG CAAATATACT 
CAAGTGCTCA ACAATGCAAC TTTTGGAAGT 
CTGTATTTAT TCCAGAACCT ATTATTGCTA 
TGAAGTGAGA ACCAGTTGGA GCCAGCAACG 
TTGAGATTTT CAGAATCACT TAACCCTATT 
AGCAAAACTG GGAAGTCAGC AGTTTGTTTT 
GTTTCTCAAA TGTGTCAGTT AATCTCAGTA 
ACCTGCCCAA GCGGTCTAGA ACTTGCCAGT 
AGCTCCTGAC TGTCTCCTTC TTCACTCTTT 
TTTTAACTAT AAGTATTCAT GATTCGCCAC 
GCTTTTTCCA CATATCAGCC AATGGAAATA 
ATGTAGTAAT AGGATAAGCA CAAGTCTTCT 
TTTTTTTTTT CAGACAAGAT CTTGCTCTGT 
GGCGTGTTCA TAGCTCAATG TAACCTCCAA 
CTCACACCTC AGCCCCCTGA TTAGCTAGGA 
TTTTTTTCTT TTGTCTGGTT GTGTTGCCCA 
CTCAAGTAAT CCTCCTGCCT CGGCCTTCTA 
TGAGCCACTG TGCCCGGTCT CAAACCTTTT 
ATTAGATATG GAATATAGTC TAGTTCCCAG 
ATTACCCTCA TTATTAACTT CAAATTGTTT 
TTATACAGTT AAAATTTTTG TTTTGTTTTT 
CTATGAGTTT TACTTTACTT ATTTATTTTA 
CTCTGTCACT CAGGCTGGAG TGCGGTTGCG 
CTCGACCTTC TGGGCTCAAG TGATCCTCTC 
ACTACAGGCA TGCACCACCA CATCTAGCTA 
AACAAGGCTT TACTATGTTA CCCAGAGTGG 
GGGATCCTCC TGTCTCAGCC TACCAAAATG 
CATAGCGCCA GACCTGGTTT TACTTTTCTT 
TGTAATTTGG AAAATGTTTT GTTGCTTTTA 
TTTAAATACA ACATTTCTCG ATATATATTT 
AACCTAACAG TTTCCTTAAG AAGGCTGATA 
TTAGGAGAAG ATTTTATTCA ATTGCATAAA 
CAAAAATGCA AAACTCTATG GTCCTGATGT 
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TTACTCTTTA ACAAACGACT 22800 

TTATGCATAT GATAATCAAT 22850 

CCACTCTCAA AAAGTGGTAA 22 900 

ACAGAAGTTC CCAAACTTAT 22950 

AGCATGCCTA GGCCACAAAA 23000 

GGTCTACACA ACCTAATAGT 23050 

ATGTTGATGT TTTTTTCCTT 23100 

TAAATAGCAC TGCCTGGTTA 23150 

ATGTTAATTA TTTTTCTTTT 23200 

TTGTTTTTTT TTTTGAGACA 23250 

CAATGGCATG ATCATGGCTC 23300 

GATCCTCCCA CCTCAGCCTC 23350 

ACCATCATGC CCGGCTAATT 23400 

CATGTTGCCC AGGCTGGTCT 23450 

CTCGGCCTCC CAAAGTGCTA 23500 

CCCTGGGCGA AGTATTTCTT 23550 

ATTTATTGTC TATATGAAGT 23600 

GTTGCTAAAT CCTGTAGCTG 23650 

TGCCTTGCAC AGAGTTTGGA 23700 

TTTAATGTAA AATTTGATTC 23750 

TTTTTAAAAA TTGTTTTAAA 238 00 

CTGAACTTGT GACTAGAGAT 23850 

AACTAATGAC AGTCTGGCTA 23900 

GCTTCCCCAA TTCTCCTGAT 23950 

AGAGCTTCTG GTATCTTCCA 24000 

AAGTCAGATT GATTTTTTTT 24050 

TAATTATAAT CATTTATTAG 24100 

ATAGTTATTC ATTATGAAAA 24150 

ATCTAACCAT TAAATTGTGA 24200 

GCTATACACT TTTGCAAACT 24250 

ATGCGTTATT AAGAACAGCA 24300 

TTGCTCCTGG ACTACTGCTC 24350 

AGGCAATGGT GAGTACCCCA 24400 

ACTAGCATTA TTTCTTTTCT 24450 

GAGACAGAGT CTCGCACTGC 24500 

CGGCTCACTT GAAGCTCTGC 24550 

TCCCGAGTAG CTGGGACTAC 24600 

TTTTTTTTTT TTTTTTTTTT 24 650 

TCACCGTGTT AGCCAGGATG 247 00 

TCCTCGGCCT CCCAAAGTGC 24750 

GCTAGCATTA TTTCTTATGA 24800 

CGCTCTGTCG CCCAGGCTGG 248 50 

AGCTCCACCT CCCAGGTTCA 24900 

GCTGGGACTA CACGCACCCG 24950 

TTAGTAGAGA CGGGGTTTCA 25000 

GACCCCATGA TCTGCCCGCC 25050 

GTGAGCCACT GCGCCCGGCC 25100 

TCTGCCTGGG CACATTCTTG 25150 

GCATGTGGCA GAAACTCCTG 25200 

ATCCCAGTTT ATGTTACATT 25250 

TTCCCAGCTC CAAAGTTCCC 25300 

ATGCTTGGCA ACCTGGACTC 25350 

ATTCATCCCT TCCTTTCTCA 25400 

ACCCCATTGC AACCTTCATT 25450 

ATAGAATCCT ACGTGGGTCA 25500 

TTTTGCAAAG AACTTGTAAA 25550 

ATTTATTCAA AACATAGAGT 25 600 

AGGATTAAAT GGGAAATGAA 25 650 

TCCTGCTCAA ACTTTTTTTT 25700 

TACCCAGGCT GGAGTGCAGT 25750 

CTCCTGGGCT CATGCAATCT 25800 

CTACACTATG CCTAGCCAAT 25850 

GGCTGTCTCG ATCTCCTGGC 25 900 

AAGTGCTGGG ATTATAGGCA 25 950 

TTTCCAAAGT AAATGAAGTT 2 6000 

ATATCCATAT CCATTGGTTT 2 6050 

AATAGACCCT CATATCTCAG 26100 

CTGGAGTATC TTATTTATAA 26150 

TTTTTTGAGA CAGACGCTTG 26200 

TGATCATGGC TCACTATGGC 26250 

CCTCAGCCTC CCAAGCTGAG 26300 

ATTTTTTTTT TTCCCCATGG 2 6350 

TCTCAAACTC CTGGCCTCAG 2 6400 

CTGGGATTAC AGGCATGAGC 2 6450 

GACTTTGAAT TACAAGTTTT 2 6500 

AATACTGCTG TATGTTTGCT 2 6550 

TGAGAATTGC TGTCTTTCAG 26600 

TTTTCATCAA TGGGTCGCAG 2 6650 

CTTCTAAGAA AGTCCACCTT 2 6700 

TGGTCAGCCT CGAAGAAAGA 2 6750 
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CGGCTAAGAT GCTGAAGAGG TAGGAACTAG AGGATGCAGA ATCACTTTAC 
TTTTCTTCTT TTTCCTTTTG AGACAGAGTC TCACTCTGTC AGCCAGACTG 
GAGTGCAGTG GTACAATCAT GGCTCACTGC AACTTCGACC TCCCAGGCTC 
AAGCAATCCT CCCATCTCAG TCCCACAAAT AGCTGGGACT ACAGGTGCAC 
ATCACCACAC CTGGCTACTT TAAAAAAATT TTTTTGTAGA GATGGGGTCT 
CCCTGTGTTG CCCAGGCTGG TCTCTTGAAT TCCTGTGCTC AAGCCATCCT 
TCCACCTCAG CCTCCCAGAG TGCCAGGATT ACAGGCATGA GCCACCACAC 
CCAGCCACCA CTTTTCTTAA AAAAAAAAAA AGATTCTCTC TGGTAGACAA 
TCCTCAATAG TCCACATGTT ATTAAACAAT CTGCTGCCTG AATACATGAT 
TTACCAAAAA AAGGAAATTT TGACGGGTTC AGAATATCAA GGGATCTGAG 
GCAAATGTCA CCTATGATAA AATTTGCTAT CAAAATTAGG AAGTTTGTGT 
TTACCTGATC CTAAAGCAGT AACCAGCCCA TTTCTAGGGA ATAAAACTCT 
CATGCGTATA TTGTGCATAT ATATGTATTA TATGACTGAG TGATAATAAA 
ATTTTTTTTC TAGCTTCCTG AAGGCTGGTG GAGAAGTGAT TGATTCAGTT 
ACATGGCATC AGTAAGTATG TCTCCTATTC TTAATACTAG GAAAGTAAGG 
CTAGCTTTAT TTATTACCTA GTATTCAAAA AGTTAGTTCA TTTAACTGCC 
AATTGACTGC AGTTCAAATA AGAAACAAAT AGTGTCTCAA GTAGCACTGT 
ACTCCAATTT TAATATTAAT AAAAAAAATT TTAAGTTATT TTAAATAATG 
TAGTGGTTTC TATAAAGATC ACTTTATACA GAAGAACAGT GCCAATTAAC 
CCATGGAACA TATAAGTAGC TAAAACCAAT TGCTTGCCAA AGAACCAGTA 
ACCCAGGAGT ACATGTCCTT GCCACTGTGT TTTTTCAAGA CAGAGTAACT 
GATTTCTAGT TACTTGCATA GAATGGACTC CTCCTCATAA CTCCCTTCCA 
TCTTGGTCTT TCCCTAGTAG AACTTCTACC TTTTTTTAGT AACAGGTGAG 
TGGGAGAGGT AAGAAGGAGA ATAAGGTCAG CAATTAACCT AAAAGCAGAA 
AGTAAAATTT GTTATTTTTT TTCTGAATAT TTTCTGTGTA ATTTAGCTAC 
TATTTGAATG GACGGACTGC TACCAGGGAA GATTTTCTAA ACCCTGATGT 
ATTGGACATT TTTATTTCAT CTGTGCAAAA AGTTTTCCAG GTAATAGTCT 
TTTTAAACTT TTTAATGTAA AACCAGAATC CTTATTTTAT AGTCTAGCTA 
GTTCTAAATT CTATAGGTAT GTATATTTAC ATGTTTTTCT AATTTTAGAG 
AACAAGCACT ATGACTTATC CACTGTTAGT TTTCCCCTTA GCATTGGGTC 
TTACCCCATG TACGTGATTA GAAATTTGAA ATATTTCCAA TAGCCTTTAG 
TAGAATTAAC T C ACATAG AT GATAAGAATG GGTTGGTTCA CTTCATGTTC 
CTTCCACAGC CTACTATTTC AATAAAAGAA AGTTTCCCAA GACCTAAATG 
ACTATGAACA TATTTTATAA CTATATAGGA GGGGTGGGTC TAGGAATACA 
AAGTTTTGAA TGCTGTTAAT CTTCAACACC ACAGTTGAAA CCACAGGTCA 
GCTTTTTTGC AATTACCATG GATACTTTTC TGTTCTATAG GTGGTTGAGA 
GCACCAGGCC TGGCAAGAAG GTCTGGTTAG GAGAAACAAG CTCTGCATAT 
GGAGGCGGAG CGCCCTTGCT ATCCGACACC TTTGCAGCTG GCTTTATGTG 
AGTGAAGCAG CGCTGGCCTT AGGGGTCAGA GTGCAGCTCT TCTCCATCCT 
TCTATTCTGC TGAAATAGCT CCCCAGCCAA AAAGCAGATC AAAGACCGTT 
TCAGTGGCTG AGCCCCAAAA TTCATGCCAG ATTTTGCAAG AAAATGATTT 
ACTAAAGCTT GAGGGACATC TTTAACAAGT GTTCCAAATT AATCACTATA 
AGGATGAATT GTTTCAGAAA TTTTGGCCTT TAATTATGGC CCATAAATAT 
GTCAAGTAGT CCTTACTCTA AAGAAGTACA CTGTAAAAGA ATGCATATAG 
CCGGATATGG TAGTTCCCTG TAATCCCAAT ACTTTGGGAG GCCAAGGTGG 
GAGGATTGCT TGAGCCCAGG AGTTTGAGGC TGCAGTGAGT TATGATGGTG 
CCACTGCACT CTAGACTGGG CAACAGAGTG AGACTGTCTT TTTTTTTCCC 
CTCTGTCACC CAGACTGGAG GGCAGTGGCA CGATCTCACC TCACTGCAAC 
CTCTGCCTCC CGGATTGAAG CGATTCTCCT GCCTCAGCGT CCTGAGTAGC 
TGGGACTACA GGAGTATCAC CGCACTGGGC TAATTTTTGT ATTTTTAGTA 
GAGACGGGGT TTTGACATGT TGCCCAGGCT GGTCTGAAAC CCATGAGCTC 
AAGTGATCTG CCTACCTCAG CCTTCCAAAA TGCTGGGATT ACGGACATGA 
GCTACCACGC CCGGCCACAC CCTGTCTCTT AAAAAAAAAA AAAATGCAAG 
TTAGAGCATA TTACAGCTTT GTCTCTCAGG AGGATACTTA GTGTATGTAG 
CTATAATTCA TAGATTCCCA AGAAGTTTAG AGCCTAAAGT ATGAGGTCCC 
ACCAGAGGGG CTATCATTAA ATTTAAAGAT TTGTTAAATC ATCTCATTGT 
CCAACACCAC AAACTTGATT GCTTTAAAAT ACTGGTTTAG TTACATTTAG 
TAACTCTATT AGTGCTTTTA ATCTATACTG CTATATCCTC ACATTGAGAT 
TTTTTTTCTT TTCTCTTCCA TCTTCATTCT TTTTTCTCTC ATCCTCATTC 
TTATAAGCCT AGAATACATC ACAAATCCTT TATGCCCATG GAAGCAAGAG 
GAATAAAGAA TGGAGATGTT TGTTTTGCCA TTAACTAAAG ATCTGGGGTG 
TCGGGGAGAA GGGGGATAGA GAAGGAGAAG TGGGAAGAGG TGTCCATAAT 
AGCTTAGGTG CAATTCTGCT TATTTTACAT TTTACCCCCG CTGACTGCCA 
CTTTTTCTTC AGCCCTCACA CATTGTTTGT GCAGGGACCT CATAGGACCA 
GGAATTGTCT ATAGAGGTGG GAATTTGTCT CACCCTGAAA GGGATACCTC 
TAGCATGGTA ATAGTCTTCT AGGATTTGTT ATCATATGGA AAGATGTAAA 
GGGAGGGATT CTGCTGCTGC TGCTGCTGCT GCATGCAGTT GCCATTTCAT 
TTAAATGACT TATTTATAAT TGATGACACT TTTCTGGCTT CCTGTTAATT 
CCTCCCTCAA AGATCAATAA ACCAGAACCA GGCATGGTGG CATGCACTTG 
TGGTCCTGTA ACCACCCAAC AGGTTCACCT TGCCTGCTGT CTAGATAGAG 
CCAATTATCA AGACAGGGGA ATTGCAAAGG AGAAAGAGTA ATTTATGCAG 
AGCCAGCTGT GCAGGAGACC AGAGTTTTAT TATTACTCAA ATCAGTCTCC 
CCGAACATTC GAGGATCAGA GCTTTTAAGG ATAATTTGGC CGGTAGGGGC 
TTAGGAAGTG GAGAGTGCTG GTTGGTCAGG TTGGAGATGG AATCACAGGG 
AGTGGAAGTG AGGTTTTCTT GCTGTCTTCT GTTCCTGGAT GGGATGGCAG 
AACTGGTTGG GCCAGATTAC CGGTCTGGGT GGTCTCAAAT GATCCACCCA 
GTTCAGGGTC TGCAAGATAT CTCAAGCACT GATCTTAGGT TTTACAACAG 
TGATGTTATC CCCAGGAACA ATTTGGGGAG GTTCAGACTC TTGGAGCCAG 
AGGCTGCATT ATCCCTAAAC CGTAATCTCT AATGTTGTAG CTAATTTGTT 
i AGGTAGACTT GTCCCCAGGC AAGAAGGGGG TCTTTTCAGA 



26800 

26850 

26900 

26950 

27000 

27050 

27100 

27150 

27200 

27250 

27300 

27350 

27400 

27450 

27500 

27550 

27600 

27650 

27700 

27750 

27800 

27850 

27900 

27950 

28000 

28050 

28100 

28150 

28200 

28250 

28300 

28350 

28400 

28450 

28500 

28550 

28600 

28650 

28700 

28750 

28800 

28850 

28900 

28950 

29000 

29050 

29100 

29150 

29200 

29250 

29300 

29350 

29400 

29450 

29500 

29550 

29600 

29650 

29700 

29750 

29800 

29850 

29900 

29950 

30000 

30050 

30100 

30150 

30200 

30250 

30300 

30350 

30400 

30450 

30500 

30550 

30600 

30650 

30700 

30750 
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AAAGGGCTAT TATCATTTTT GTTTCAGAGT CAAACCATGA ACTGAATTTC 30800 
TTCCCAAAGT TAGTTCAGCC TACACCCAGG AAT GAAGAAG GACAGCTTAA 30850 
AGGTTAGAAG CAAGATGGAG TCAATGAGGT CTGATCTCTT TCACTGTCAT 30 900 
AATTTCCTCA GTTATAATTT TTGCAAAGGC GGTTTCAGTC CCAGCTACTT 30950 
GGGAGGCTGA GACAGGAGGA TTAATGGAGC CCAGGAGTTT GAGGTTGCAG 31000 
AGAGCTATGA TCACGCCACT GCACTCCAGC CTGGGTGACA GAGTGAGACC 31050 
CTGTCTCTAA ATAAATAAAT AAGTAAATAA ATAAATACAT AAATAAAATC 31100 
AAGATGGTGT GCAATTAGAA TTGAGCGATT TTGTTTCCAA ACCTCAAGAA 31150 
AGCTTGGTCT TGCTCTGTCC CAGGTGGCTG GATAAATTGG GCCTGTCAGC 31200 
CCGAATGGGA ATAGAAGTGG TGATGAGGCA AGTATTCTTT GGAGCAGGAA 31250 
ACTACCATTT AGTGGATGAA AACTTCGATC CTTTACCTGT AAGTGACCAT 31300 
TATTTTCCTA ATTCTAGTGG AGTAGATTAA AGTCAACTCA GGACCTCTGG 31350 
TGTTAACCTC CTATGAACAG TCAGTCCTCT CAGTAACTAG CCAAATCATG 31400 
AGATGATGAA TTAGAAGGAG CCTTAGATAG CATCCAATCT AACATTTTTT 31450 
TGTGTGTTTG AAGAGAAGAA ATCAAGAGCT AGGAATAACT TTTTAAAGGT 31500 
AAGCCATTTG CAGTATAGTG TGGATTTTGT TTAAAAGGGG ATAATTT GAA 31550 
ATTTTATGAC TCATTATACA AGACAAAATA AGTTGGATTT TCAAATGTTT 31600 
T AC AAAGTAA ATCAAAGTTA TAATTGCCTA CAGTACGCAA AGCT TCAAAA 31650 
CATTTTTTAT GTTATGAAAT TGTAATTTAT TTAACCTTAA AATGAGCCAG 31700 
TACCATGTGT TTGCTTAAAA ATCTCATGCT AAGAATTTAC TATGTTGTTA 31750 
ATAATCTTCA AGATATTTAT GAATAAAGTC TTATTTCTAA TCCTTCCTCC 31800 
AACTGTATCT GGTGCTAAAT CAGGAAATGT TTCTTCCCAA AAAGCCTCGT 31850 
GGAAGATCTG TATGT CT AAA TATATGTCAG GGATAATACA GATGTAGCCC 31900 
TGCGAAGCAT GACCTTGATT TTTATAGTCT AAAATGTCAT TTGCAGATAT 31950 
CTATTTTCTA AGAATAATTC CTAAAAGAAT TATTTGAATG TTGTAGGAAA 32000 
GCTAAGAAAT TTTGCAAAGA GCGTACGTGA AAATATAAGC TAGGCTTTTG 32050 
TGGTTTGTGG ATAGACTTCC CAACAAAATT GCTTTTTATC TATAGTGATC 32100 
CAAGCTTGTG GAACAT AT T A GTCATCTTTT TTTAGAAAAT TCTTAGAAAA 32150 
GTGATCTTGC AAAAATGGAA TTTATCTTTC CCCAAGTATA TTCTGTCATG 32200 
TATAGAGTTA AACTAAGCAT AGTAATTTCA CCAGACAAAC ATT CAAAATC 32250 
TACTCCTGAC CTTTTTATCT CATCCAAATT TTCCCAGGGC CCAGACATAA 32300 
ACCTTTGCCT TACGAACTCT TTGTATATGC ACTAAATATG CTTCTCCTTC 32350 
AAGGTTCTCA GTCAGCTAGA AAAATGTGCA AGAGTAAATG GTACCCTTCT 324 00 
CACTTGTAGA TCCAAGAGAA TTAGACTTAA ACTCACTCTA CATGTCTGTG 32450 
ACTTTATTTT ATTTGCATGA CAGTCCTGTG AGGTGGCAAG GCAGGTATCT 32500 
TGGATCCATT TTTTAGATAA GGAAGTTCAA ATTGAGAAGA GGTTGCATGA .32550 
TTTACAGGAA GCCATACTGT AGTCCTATGT TACTCTTAAA AATCCCATTC 32600 
AAATCCTGCT TCTGAGGCCT GCATACTTTC TACCCTACCA GTCATTGACC 32650 
CATGCTTATG TCTCCTTTGA AAACATTGAT TCCACTCTTG TCTCCAGTGA 32700 
AAAAGTGGAA TTTAAGCAGA GAAACAAAAG CCATTTGTCT TGTTAAGTCT 32750 
ACTTTCCCTC TACTTTCAAG AAGGAAAGTT GGGGTATGTG TTGAATGGTG 32800 
ATTTATTTAT TTATTTATTA TTTTAAAAAT TGATACAAGG TCTTACTGTA 32850 
TTGTGCAGGC TGGTCTCAAA CTCCTGGGCT CAAGTGATCA TCCCACCTCA 32900 
GCCTCCCAGT GTTGGGATTA CAGCATGAAC CATTGTGCCC ACCACCGATC 32 950 
CGCAGTTTTT TAAGAAAAAC TTTTACTATA GAAAATTTTA ATCATATACA 33000 
AAATACAGAG GAAAGTATAT GAACCCACTT TAGGAGACTA GAATATGCCA 33050 
CCCCAAAATA TGCCACTTTG GCATAAGGAT TATTTCGAGC TAAAGGCAAC 33100 
TGGGAAGAAA CACATAGAAG AAAAGTTCTC TGTCCTTCTC CATTTGCCTA 33150 
AAAGCAGGAC ATGAATCTTA AAAGTCCCCC TCCTTCCCTT TCTACCAGGA 33200 
AAAACAAGAG TTAATCACTG AAGATAACTT CAGACCCTTA TCAGTGTAGA 33250 
GATGGCACTA GAAGAATCTA TATTACATAC TCATTTATTT TCCTTCCCAC 33300 
AACTTGCCAC CCCAGAGACT AAAAATCCTT TTCCTTTGTC ATGTCTCTTG 33350 
TCCAAAAATT TGCTCTATAA GCTGGAGTTC TAAGCCACCT CTTTGAGAAT 33400 
TACTTGTTCC CTGGTATTTT CTGTTAACAT ACATGTATTA ATATACATGT 33450 
TAACAAGCTT CTGTTTGTTT TTCTCCTGTT TTCTGTCTTG TTACAGAGGT 33500 
CCATCCCAAC TAAGAACTAA AGAGTAGGAG GAAAATATAA TTTCCTCCTG 33550 
CATACTTTGA TCTTGTTTAA TCCGTAACCC TTCCCACTTT TCACCTCCTA 33600 
CCTATTAGAT TACTTTGAAG CAAATTTCAG ATATATTACT TTATCTATAA 33650 
ATATTTCAGT ATGTGCTAGG TGTGGTGGCT CACACCTGTA ATCCCAACAC 33700 
TTTGGGAAGC TGAGGCAGGA GGATCACTTG AGCCCAGGAG TTCAAGACCA 33750 
GCTACGGCAA CAAAAAATCA AAAACTTATC TGGGCATGGT GGCACATGCC 33800 
TGTGGTCCCA GCTACATGAG AGGCTGAGGC AG GAG GAT C G CTTTAGCCCA 33850 
GGAGGTTGAG GCTGCAGTAA GCTGCATTCA CACCACTGCA CTCCAGCCTG 33900 
GGTGACAGAG TAAGACCATG TCTCAAAAAA ATACATATTT TAGTATGTAT 33 950 
CCTTTTTGTA AAAACACAAT ACTTTTATCA TACTTTAAAT AATAACAATA 34000 
ATTCCTTAGT ATCACCAAAT ATTTTGTCAG TGTCTCACAT TTTCCTTATT 34050 
GTCTAAAATA TTGTTGATAG TTATTCAAAT CAGAATCCAA ACAAGGTCCA 34100 
TATATTACAT TTGGTTGACA AGTCTCTTAA GTTTGTTCAT CTTTAAGTTC 34150 
TTCCTCCCTC TCTTTCATCT CTTGTAATTT ATTAATGTGA AAAAACAGGT 34200 
AATTTGTTCT ATAGTATTTC CTACATTATA GAGTTTGCTA CATTTATTCC 34250 
CTATGATATC ATTTAGCATG TTCCTCTGTC CCCTGTGTTT CCTGTAAACT 34300 
GGTAGTTATA CCTAGAAGCT TGAGTTTATT CAGGTTTTTA ATTGTATTTT 34 350 
TTTTGCAAGA ATTCTTTATT ATCTGCTTCT GGAAGCACAG AATGTCTGGT 34 400 
TGTGTCTGGT TTTGATCTTG ACAGCTACTG ATGACCATTG CCTAATCCAT 34450 
TACTTTATTG GGGTGGGGGG AATAAGGTTT TAAAATAAAT TTTTTTTAAA 3 4500 
GATTTTTTTA ACTGTTATTT TGAGACAGTG TCTCATTTCG TTTCCCAGGC 34550 
TGGAGTGCAG TGGCACAATC ACGGCTCACT GCAGCCTTGA CCTCCTGGGA 34 600 
TCAGGTGATC TTCTCACCTC AGCCTCCTGG GTACCTGGAA CTACAGGTGC 34 650 
ACACCACCAC ACCTGGCTAA TTTTTTGTAT TTTGTGTACA GAAGGGGTTT 34700 
CATCATGTTT CCCAGACTGG TCTTGAACTC CTGGGTTCAA GTGATCTACC 34750 
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CACTTCAGCT TCCCAAAATC CTGGGATTAC ACTTTGGCCA CCGTGCCTGG 34800 

CCTAAATGAA ATTATTTGTC TCTAAACAGA CAGAAGTTTT ACTTTAAAAA 34850 

TTTGTCTTTG TGTGTACATG TGTTTGTGTA TGTGTGTGTG TCTAAAAGTT 3 4 900 

TGGCTTTGAG CTTTGCTTTG AATTCTTGGA TGAACAATAA C CAAGAAT AC 34950 

TTAAACTCTG ATCATTCTTG ACAGATATCC CCTACAGGCT ATGGCCTTTT 35000 

GAATTGTGTC CTCCAGTGAT AAAAAGCAGC AAGCACGATA CTGCTCTCAG 35050 

ATTCATGGTG GTCACATGTG AGGTGAAAAA AAAAAAAAAG ATGAATCCTA 35100 

TTTAAATGCC CCCAGGATAA CAGTGATACT CTTTGTAGGA TAACTATTTG 35150 

CTTGCCACTG GTTTCATTAA ATAAGGACAT AAGTAAAGAT CTATTTTTGT 35200 

CTCTTTCTCC CCAACCACCA CAACTAGGAT TATTGGCTAT CTCTTCTGTT 35250 

CAAGAAATTG GTGGGCACCA AGGTGTTAAT GGCAAGCGTG CAAGGTTCAA 35300 

AGAGAAGGAA GCTTCGAGTA TACCTTCATT GCACAAACAC TGACAAGTAA 35350 

GTATGAAACA CACCCTTTAC CAATCATCAA GTTTTAGTGG GTAAGCCTGT 35400 

AACTTTACTC AAACACC CTG TTGCATGTGT CTATACATTG CATAAGTATA 35450 

GGCAGTTGCA ATTTAGTAAA GTTTTATACA ACGATTTTAT TTTATTTTAT 35500 

TTTTAGAAGA AAAATGCTAC TTTTGTTGTT GTTGTTTTTT GAGACGGGGC 35550 

CTCGCTCGTC ACCCAGGCTG GAGTGCAGTG GTGCAATCTC AGCTCACTGC 35600 

AACCTCCGCC TCCCGGGTTC AAGTGATTCT TGAAGAGGAG AACAATAATA 35650 

ACAACAATAT TATTTTCAAA AGTTGTGACC GCAGTTTCTG GAGTTGAGAA 35700 

GACATCGAGA TTTTTGTAGC CTCATACTCT TGCTTTAGGT AGCAAAAAAT 357 50 

GTTCCTAAAT CTCAGGAATA TTCTCTAGAT AGGTTTCAAT CTATCATTCC 35800 

TGATAAGATG ATGCTGAAAT ACTAATTCTA GCCAAAAAAG ACCAGCTACC 35850 

ATTTCCGATT GTTGGGGACT GGGAACTCTG GATAGTGAGG ACCCCAGTAG 35900 

GAAGTAGCGA GGGGAATGGT TTGAATGGAT AAATTCATAA AAAATGTCAG 35 950 

TAGATTTAAT TTTCTTATAC ATTTCAGTCT TTTTATAAGG CTAGGAAAAG 36000 

CCCCTGTTTT TATGGTTTAT AATTTGAATT CACATGAACC CACAAAATTT 3 6050 

GCCTTTTACC TTCCTATGTC TGAAAATGGA TAGTCTGGCT GGCCTCTTAA 36100 

CAACCCAGCT GGCAGAGCTG TGAGGATCTC AGTGTGCTCT AGCCCAGACA 3 6150 

TTGGTAGCAT GAACGGCAAC ATTTTTAATT GTGTTTTCAA AATAGGAGCA 36200 

CACTAGCGGT CTAAAACGAT CATAAAAGAA GGATACTAAG AGGGCCCACT 36250 

GTCATTATGG ATCCTAATAC TTAGGATGCA TTATGGATTG TCATTATGGA 36300 

TACTAATACT TAGGATCACA TTTGTAATTG AGTTTTTAAT TGCTTAAATT 36350 

AGATACATAT TTCTATTAAG TTAACCTCTT TGCTTTTAGT CCAAGGTATA 3 6400 

AAGAAGGAGA TTTAACTCTG TATGCCATAA ACCTCCATAA TGTCACCAAG 36450 

TACTTGCGGT TACCCTATCC TTTTTCTAAC AAGCAAGTGG ATAAATACCT 36500 

TCTAAGACCT TTGGGACCTC ATGGATTACT TTCCAAGTAA GTAATTTTCC 36550 

TTGTTCATTC CAAACTTTCA ATAAATTTAT TGGTGTTTAT CAGAATAGAG 36600 

AGTTTGGACA GGGAGCAAAA GACAAAGTCA ACTATATCAA GTTCTAATAA 36650 

TTCTTAATAT TCAGGAAATT TATGTATGAA TACTTACTAA TATGAGTATA 36700 

ACTCATCCTA AGAGTCTAAA GCAAAAGGAT GTGAACACAA ACTAGCAGTT 3 6750 

ATCTTAGAGA ATAAGTTTGC ATTTCAAAAT AACTTGACAT ATCAAGATCC 36800 

ACTCAACGCA TTTAAATTAT TTACTCTAAA AAGACATAAT TCTTGGTAAC 3 6850 

ACATTCACTA AAGCAAAATA TACCTTTATA TAATTGCTAT CAAAGGTATG 36900 

TGGGTTGGTA TAAAATATCA TACCATGTGA GATCAGTGTG ATTCCTTTAC 36950 

AGCATTAATT TTTATTGGTT AGAGTAAGAA AAAGAATAGC TAGAGTATAT 37000 

TTCTTAAGTA GATTCTCATA CACTTTGGTT TCAAAAACCA ATTATTGACT 37050 

ACATCTTATA AAAGCCTGTA TTCAATGGAG TGCCAAAAAA TGACTATGAG 37100 

TCTTAAAGAG TTAGGCATAT AAATATTTTA AGGTTTCTGT TCAATGTATG 37150 

TTGGAAGGAG TTCCTTTCTC ATGACTATTC TCATATTGGA GCATAAAAAG 37200 

AGTTTACAGG CTTGGCGCAG TGGCTCATGC CTGTAATCCC AATACTTTGG 37250 

GAAGCTGAAG CAGGCAGATC ACTTCAGCCC AGGAGTTTGA GACCAGCCTG 37300 

GGCAATATGG CAAAACTCTC TCTACAAAAT ATACCAAAAT TAGCCAGGCG 37350 

TGGTGGTGCA TGCCTGTAGT CCCAGCTACT TGGGAAGCTG AGGTGGGAGG 37400 

ATTGCTTGAG CCCAGGGGGG TCATGGCTGC AGTGAGCTGT GATGGTGCCT 374 50 

CTGTCACCCA GCCTGGGTGA CAGAGTGAGA CCCTGTCTCA AAAAAATAAA 37500 

TAAATAAAAA TTAAGAGTTT ACAAAATTCT CACCATCTCC TCCCATCTTT 37550 

GCAAATGCCA CATAAGTGAT GTGTTCCAGG ACTATTAGCC TCGGAACCTG 37 600 

AGGCAGTACA GTAAGCACGC TTTCTCCAAA GTCCTGTCCC CCACAGACAA 37 650 

ACATTATTTA CACTGGGTAC TGCTCTTTTA TTTTTTCCCC TCTATGCTTT 37 7 00 

ATTTTACTAT AACTATAATC ATATAACATG TAATAGGAAA AAGGCAGGGT 37750 

CGGGGGAGAG ATCCAGAAGT CTTCCCAAGA GCCTTTCCAA CATAGCCTCT 37800 

GTAGACATTT TTTCTTTCTT CTTTTTTTTT TT TTTTTTTT TTCTGAGACA 37 850 

GAGTC TCACT CTGTTGTCCA GGCTAGAGTG CAGTGGCGTG ATCTAGGCTC 37 900 

ACTGCAACCT CCGCCTCCTG GGTTCAAGCA ATTCTCCCAC CTCAGCCTCC 37 950 

CTAGTAGCTG GGATTAGAGG CATGCATCAC CACGCCTGGC TAATTTTTGT 38000 

ATTTTTAGTA GAGATGAGGT TTCACCATGT GGGCCAGGCT GGTCTTGAAC 38 050 

TCCTGACCTC AAGTGATCCA CCTGCCTTAG CCTCCCAAAG TGCTAGGATT 38100 

ACACGAGTGA GCCACCGTGC CCTGCCCCTA TTACATTCTG ATCACACATT 38150 

TCATGTTTTA TAATTGGAAA ACTGGTGAAA TTATAGACAA TGTTTTGTTC 38200 
CCCTAAATTC TCTTTGATGA GTATATATTA CTTACACTCT TCTGTCTTTA 38250 
AAATTTTGCA AAATAGTATC CTAGATAAGT TTATGAGTGC ACAGTCTGTA 38 300 
CGCTTACTCA TATTAATGAC CTCGGAGAGT TAAACAACAG TCACCTTTAA 38350 
AAATTATTAC TATCATTATC ATTATTTTTG AGGCGGGGGT CTCATTCTGT 38400 
CTCCCAGGCT GGAGAGTAGT GGTGCGGTCA CAGCTCACTG CAGCCACCGC 38450 
TACCTGGGCT CAAGTGATCC TTCCTCCTCA GCCTTCTGAG TAGCTGAGAC 38500 
CACAGGCTTA TGCTACCACA CCTGGCTAAT TTTTTAACTT TTTGTAGAGA 38550 
CGATGTCTCA TTATGTTGCC CAGGCTGGTC TCAAACTCCT AAGCTCAAGT 38600 
GATCTTCCTC AGCCTCCCAA AGTGCTGGGA TTACAGGCAT GAAAAACTGC 38650 
ACCCAGCCCT AAAAATTATT AGGGTCCTGC ATAGTAAGAC TTTAATAAAT 387 00 
ATTTAAATGA ACATCTGGTT TTTTTAAAAA AAAAATAGAG ACAAGGTCTC 38750 
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ACTATATTGC CCAAGCTGGT CTCGAACTCC TGGACTCACG CAATCCTGCT 38800 

GCCTTAGCCG CCCAAAGTGC TGGGATTACA GGCATGACCC ACCTCATCTG 38 8 50 

GGCTGAGTGA ACATATTTTT AACATAAAGG CCGTATTTTA TATTTATCTC 38900 

ATACATTTTG CCCAGCATCC CCATTTCCGC CGAATCTGTT GCTTGCTAAT 38950 

TCCTTCCAGC TTCATTTCAT CTGAAATTTG ACAAACATCT TCTATTTCTT 39000 

TGTCGTCATG TTATTGACTT CAGAATATAA AATAAAACAC TATACCCAAA 39050 

TTAAACCCCA CCCTCATTGC CCAGCCTGAT GTGAAAATAA TCAGCATACA 39100 

TTAAGCTTAC CCTTGATATA TGTGTAGCAT CTTTTAGATA AATATACAGC 39150 

TGATTAAGCA ATATAGCCTG ATGGTATAAT ATCTTGCCCA TGTACCTCAT 39200 

CTTATCTCCA GCAGGATTAA TTCACAGTGA TCAGATTTAC CTTTAAACTT 39250 

TGTAGCAAAA TATCCTCTCC AAAAGCATAT CTAAAACTTT TGTGTGTACT 3 9300 

CTTGCAAGTT TCTTAATTTC ATGCAGAACA GGCTCTTACC ACTGTTAGCT 39350 

GGAGATATTT TCAAGACCTA TTTTTGTTTG TGGTTTCCTG ATGATGGTCA 39400 

TGGCATTTCC CCCTTCACTC CATCTAAAAA TTGAGGTGAT ACAGGCTTTT 39450 

AAACAAAACC AACTCATATA GACTGAGTAC AACTGCAATG CAGGCATGCT 39500 

AACCTCTGCT ACAATCATGG GCGTGCTATT GATATGTCTT AAGTTACAGA 39550 

ACACAGGGCT GAGCGTCTCA TTAGGTCAAA ATGTAAACCA GTTTTTCTGC 39600 

TCACTGATGC TTAATGAGGA CAGGGTGTGA GAGATTTCTT TAAGGAAAAC 39650 

AAATATATAA TAATGCTACA TGGAAAAATA TCTAACATTA GAGAATTAAG 39700 

TAAATAAACT AATATACTCA CACCATGGAA TCTTGTGCAG ACATTAAAAT 39750 

TATGTAGTGG ATGGATGTTT AATGGTGTGA GAAAAAGT T A GGATGTGCTG 3 9800 

GGGTGGGGGG AAGAATCAAG TTTTAAGAAA ATACAGTATA CCCATACTTA 3 9850 

AGTAAAAAAA AAAAAAAAGG TATGTACAGT CATGTGTTGC TTAATGATGG 3 9900 

GGATACATTC CGAGAAATGT GTCGATAGGT GATTTCATCC TTGTGTGAAC 39950 

ATCATAGAGT GAACTTACAC AAACCTAGAT GGTCTAGCCT ACTATGTATC 40000 

TAGGCTATAT GACTAGCCTG TTGCTCCTAG GCTACAAACC TGTAAAGCAT 40050 

GTTACTGTAG CGAATATACA AAT ACT TAAC ACAATGGCAA GCTATCATTG 40100 

TGTTAAGTAG TTGTGTATCT AAACATATCT AAAACATAGA AAACTAATGT 40150 

GTTGTGCTAC AATGTTACAA TGACTATGAC ATTGCTAGGC AATAGGAATT 4 0200 

ATAATTTTAT CCTTTTATGG AACCACACTT ATATATGCGG TCCATGGTGG 40250 

ACCAAAACAT CCTTATGTGG CATATGACTG TATACATGTA CACAAAAAAT 40300 

AGATGAAAGA AT G AAT AT AC ATCAAAATAT TTAAAATGGT TATAATGACT 40350 

TAGGTTACTT TTATTTATCT TAGTAATAAT AAT GAT GAT A GATAATACTT 40400 

TTATAGTGTT TACTATATAA AAGACACTGT TATAAGTGTT CTACATACTT 40450 

TACATGTATT ACCTAAATGA TATAAATATA ACTCTGACAG TAACTAATCT 40500 

TATACGTTCT CTTTTCTTTT TTTTTTTTTT CTTTTTTTAG ACAGAATCTT 40550 

GCTCTACCAG GCTGGAGTGC AGGGTGCAAT CTCGGCTCAC TGCAACCTCC 40600 

GCCTCCCAGG TTCAAACGAT TCTCATGTCT CAGCCTCCTG AGTAGCTGGG 40650 

ACTACAGGCA CACACCACCA TGCCCGGCTA ATTTTTGTAT TTTTGGGTAG 40700 

AGATGGAGTT TTGCCATGTT GGCCAGGCTG ATCTTGAACT CCTGGCCTCA 40750 

AGTGATCTGC CTGCCTCAGC CTCCCAAAGT GCTGGGATTA CAGGTGTGAA 40800 

CCACTGTGCT CGGCCTAATC TTACAAGTTT TCAATATTTA AAGAGTGCTA 40850 

ACTTTGTTGA CAATATAAAA CATATTTGAG AAAAAGAGAT ATAAGCATCT 40900 

TATTTAGAAT TATGAAAATA TCAATAGACC TACAGCCGAC TAAAGCTTTT 40950 

CTTCATAAGC TCTTGCCTAT ATTGATTCGC TCCTGTGAAT ATGCATTAAT 41000 

TTGATTTAAA TAATAAGTAT GTATAAGAAA TAACACTTTT CCTTAATTTT 41050 

TAAGAACGTT CAACAGTTTT TAATTTGAAT TCCAATAGTG AAATACATAG 41100 

AAAATATAAA ATTTTCTGTA GTTTAGCCAA ATTGTTTTTG TTTCACCACA 41150 

GCATTCTACC AAAATTTCTT AATAACAGTA AGAAAATGAA TGCATACCTC 41200 

CTGCAGGGAG AGGGGAGTTA GGCAGTTTAT GGGCATAGTT ACAAGTGAGA 41250 

AATTTCATTG GCTACCATTT ACGCTAAATT CATAAAAACT GCATTCAATT 41300 

CTATATATCT ATTTTCTTTA CATAAAAAAG GTTTCAATTA TTGGCCATTA 41350 

AATAAAATAG CCACCATTCC AGAAGTTGTG TCATGTTTAT CCTTTTTATA 41400 

CCACCATCAT ATTGCCTATT ATATAGATTG TGTGTGTTCC ATTTTCTGTA 41450 

ATGGGCCAGA CAGTAAGTAT TTCTGGCTTT GGAGTCCATA TGGTCTCTAT 41500 

CATAACTACT CATCTCTGCC ATTGTAGCTT AAAGATTATC TAGGTCAAAT 41550 

GCCTAAGTGA TATAGTGTTG AAATACAAGT TATATAATAT AGGCTGCCAC 41600 

AAAAAAAAAT TTATTTGGTC TAAAAAAGAT TTCATGACTT TTGTAGCAGC 41650 

ATGGGTGGGG CATGCACCAC TTGGTTAACT CGGTGTATCT TTCTCCTTTG 41700 

CAGATCTGTC CAACTCAATG GTCTAACTCT AAAGATGGTG GATGATCAAA 41750 

CCTTGCCACC TTTAATGGAA AAACCTCTCC GGCCAGGAAG TTCACTGGGC 41800 

TTGCCAGCTT TCTCATATAG TTTTTTTGTG AT AAGAAAT G CCAAAGTTGC 41850 

TGCTTGCATC TGAAAATAAA ATATACTAGT CCTGACACTG AATTTTTCAA 41900 

GTATACTAAG AGTAAAGCAA CTCAAGTTAT AGGAAAGGAA GCAGATACCT 41950 
TGCAAAGCAA CTAGTGGGTG CTTGAGAGAC ACTGGGACAC TGTCAGTGCT 42000 
AGATTTAGCA CAGTATTTTG ATCTCGCTAG GTAGAACACT GCTAATAATA 42050 
ATAGCTAATA ATACCTTGTT CCAAATACTG CTTAGCATTT TGCATGTTTT 42100 
ACTTTTATCT AAAGTTTTGT TTTGTTTTAT TATTTATTTA TTTATTTATT 42150 
TTGAGACAGA ATCTCTCTCT GTCACCCAGG CTGGAGTGCC ATGGTGCGAT 42200 
CTTGGCTCAC TGCAACTTTA AGCAATTCTC CTGCCTCAGC TTCCTGAGTA 42250 
GCTGGGATTA TAGGCGTGTG CCACCACGCC CAGCTACTTT CTATATTTTT 42300 
TGTAGAGATG GAGTTTCGCC ATATTGGCCA AGCTGGTCTC GAACTCCTGT 42350 
CCTCGAACTC CTGTCCTCAA GTGATCCACC CGCCTCAGCC TCTCAAAGTG 42400 
CTGGGATTAC AGGTGTGAGC CACCACACCC AGCAGTGTTT TATTTTTGAG 42450 
ACAGGGTATC ATTCTGTTGC CCAGGCTTGA GTGCAGTGGT GCAATCATAG 42500 
ATCACTGCAG CCTTTTAACT CCTGGGCTCA AGTCATCCTC CTGCTTAGCC 42550 
TCCCAAGTAG CTAGGACCAC AGACACATGC CATCACACTT GGCTATTTTT 42600 
AAAAAATTTT TTGTAGAGAT GGGGTCTCGC TATGTTACCC AAACTGGTC C 42650 
TGAACTCCTG GACTCAATTG ATCCTCCCAC CTTGGCCTTC CAGGTGCTGG 427 00 
GATTTCTTTG GGAGTACAGC ATGGTACAGC AGGAGATCAT TTGATGTTAC 427 50 



CTCTGTGCAG 
GCGATTAGCC 
GGCGCAGTGG 
ATGGATCACC 



GTCCCAGCTA 
GCAGAGGTTG 
ACAGAGAGAG 
ATGCTCCTAA 
TTTAAGAAGG 
TTTTAGAATT 
AAGCATACTG 
TTGCACCTTA 
CTTTGTCAGC 
AAAAAATGGT 
ATTACTATAA 
CCAATACCTT 
CCTTGTTTCA 
TCAAATGGCT 
TCAAACAGGC 
CTTCCCAGCT 
TCTGGTAGCA 
AGCAACCAGA 
TGAATAACTC 
CTCATGTCTT 
TGCCAACAAC 
TGGAGAGGGT" 
TTTACCAGTG 
TTCAAATGTA 
AATAGCTTTA 
GGTGAAGACT 
ACCCAATAGG 
CTGAGTCTCC 
AGCTAAGCTT 
CCTGAGTTAC 
GCAAAATACA 
TATACCACAT 
TTCCATTCAA 
AAAATTAAAT 
TTTTTATATT 
GTATGACTAT 
ACCACTAGTG 



TGTTGCTAGT 
ACCACAACCA 
CTCACACCTG 
TGACGTGAGG 
CTACTAAAAA 
CTTGGGAGGC 
CAGTGAGCCG 
ATTCCATCTC 
TATGGTCAGG 
TGCTTAGCTG 
CTTTCCTTCA 
CCGTTACCGT 
AAAGACAGCT 
AATAATATGT 
TAATGACAAT 
ATGAAATTGA 



GTTGGGAAGC 
TAAGCAATAA 
CAGGCTCCAG 
CTCTGCTCTG 
TGATGGCTGT 
GGAAGAAAAT 
TTTTTCAGAG 
ATTGTTCAGA 
AACGAGGTTC 
GTTGGTCAGT 
AAAAAATGTA 
TGCCTGTTAT 
GGGGTACACA 



TAATTTTTCA 
AACATCCCTT 
CCACTTATAA 
TTCCCTTAGG 
TTATTCTTCT 
TTTCTTTATC 
TTTCATTCAA 
TTTAGATCTT 
TTCACATTTG 
TCTTTTAGTA 
TGTTGTTTCA 



CAGCGAAAGA 
GTCTTTATTT 
TAATCCTAGC 
AATTTGAGAC 
ATACAAAAAT 
TGGGGCAGGA 
AGATTGTGCC 
AAAAAAACAA 
AAGCAAGGAA 
TATATTTATC 
TGTGCCATCT 
GAAACTGGTT 
AGATTTTGCT 
GAGAGGACAG 
TCAGAGGCGA 
TTTGTCAAGA 
GTTAATGCTT 
TTTTGGCTGC 
GGAAATGTAT 
CACTTCAGTA 
CCATCTTTAG 
AGCTGTTTCA 
GAGCCATTTT 
CTTCTCACAG 
AATGGGTAAT 
CTATAATTGT 
CTACAAACTG 
ATTATTTTCC 
GGATATAGTA 
CTTTTTGCTT 
TGTACTTGTC 
TCCATTACCC 
ATACCACTGT 
GTGAGAACAT 
ATAACAGCCC 
TTATGGCTGA 
CACTTATCAG 
TTTAAGTATA 
TCAATACTCT 
AAATAAAGTA 
ATGTAAAGCC 
CCCCTTGTTA 
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CTATAATACC TGTGGGGACA 
AAAGTTATTA AAAATGGCTG 
ACTTTGGGAG GCCGAGGCAG 
CAGCCTGGCC AACATGGTGA 
TAGCTGGGTG TGGTCCTGTA 
GAATTACTTG AACCCAGGAG 
ACTGCACTCC AGCCTGGGTG 
GTTATTAAAA ATGTATATGA 
GCGAAGGATA TATTATGAGT 
TTTCAAAATG TATTAGAAGA 
CTACAGGCAC CC AT C AG AAA 
GTAAAAGAGA AACTATCTAT 
GATTTTCTTC TTTCGGTTTT 
ATTGTTAGAT ATGATAGTAT 
GGAGATTCTG TAAACTTAAA 
GGATAAATTT TAGAAAACAC 
GCTTTTTCTC TACCTTTCTT 
AAGTAACAGA AACTCCTAAT 
ATTCCCACAT AACTAGACGT 
CGTCACCAGG GATCTGGGTT 
CGCTGGCTTC ATTCTCAGAC 
TGGGCCCCTT CAAACCTCAT 
TTGAGTCTCC TTCATAGACT 
CAAACCTCTC CTCATGTCTC 
GTGGCCATTT CACCAGTCAC 
CTCTGAGTAA CCCTTTGGAA 
AACACTGCAG TTCTGCGCTT 
CCTCTTAAGG ATTAATATTC 
TCTTTAAAAT TTTTTATTTT 
ACAGGGGTGA ATTGTGTAGT 
ACCTGAGTGA TGTACATTGT 
TCCTTCCGCC CTCTTCCCTT 
GTATGTTCTT GTGTACCTAC 
GCAGTATTTG GTTTTCCATT 
CCAGTTCCGT CCAAGTTGCT 
GTAATAGTCC ATGGTACATA 
TTGATGGACA CTTAGGTTAA 
TTTGTAAGGA GCTAAAGCTG 
TAAATTTTAT ATGTAAGTGG 
ATTTTTATAA CCTTGATATT 
TACAGACTCC TACATTTGGA 
TACTATCAGG ATCCTCGA 



42800 
42850 
42900 
42950 
43000 
43050 
43100 
43150 
43200 
43250 
43300 
43350 
43400 
43450 
43500 
43550 
43600 
43650 
43700 
43750 
43800 
43850 
43900 
43950 
44000 
44050 
44100 



44500 
44550 
44600 
44650 



INFORMATION FOR SEQ ID NO: 43 
(i) SEQUENCE CHARACTERI SI 

(A) LENGTH: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 
(xi) SEQUENCE DESCRIPTION: 



2396 
double 



SEQ ID 

TTTCTAGTTG CTTTTAGCCA ATGTCGGATC AGGTTTTTCA 
AGATACTGAG ATCCTGGGCA GAGGACATCC TAGCTCGGTC 
AGGCTCAAGT GACCAGTGTC TTAAGGCAGA AGGGAGTCGG 
GGCTGAACCC TCAACCGGGG CTTTTAACTC AGGGTCTAGT 
AATGGATGGG ACCTAGAAAA GGTGACAGAG TGCGCAGGAC 
TGGTCCCACC CCTGCGCGGC TCCCGGGCGC TCCCTCCCCA 
GATCTTGGAT TCTGGCCACC TCCGCACCCT TTGGATGGGT 
TCAAAAGTGG ACGTGACCGC GGCGGAGGGG AAAGCCAGCA 
AGAGAGCGAG GAGGGGAGGG CGGGGAGGGG AGGGCGCTAG 
CCGGGAGGGG TGGGAGGGAT GGAGCGCTGT GGGAGGGTAC 
CGCCAGAGGC GAAGCAGGAC CGGTTGCAGG 
GCTGCCCCAG CTCTCCCGGC AGCGGGCGGT 
GGCTGCTGCT GCTGTGGCTC TGGGGGCCGC TCGGTGCCCT 
GCCCCCGCGG GGACCGCGCC GACCGACGAC GTGGTAGACT 
CACCAAGCGG CCGCTCCGAA GCGTGAGTCC CTCGTTCCTG 
TCGACGCCAG CCTGGCCACC GACCCGCGCT TCCTCACCTT 
CCAAGGCTCC GTGCTCTGGC TAGAGGCTTA TCTCCTGCAT 
TGGCGGCACA AAGACTGACT TCCTTATTTT TGATCCGGAC 
CTTCCGAAGA AAGAAGTTAC TGGAAATCTC AAGTCAACCA 



AGCGACAAAG 
AGATTTGGGC 
GGTAGGGTCT 
CCTGGCGCCA 
ACCAGGAAGC 
GGCCTCCGAG 
GTGGATGATT 
CGGAAATGAA 
GGAGGGACTC 
TGAGTCCTGG 
CAGCGCGCCG 
GGGATGCTGA 
GGCCCAGGGC 
TGGAGTTTTA 
TCCATCACCA 
CCTGGGCTCT 
ACTTGAGATT 
AAGGAACCGA 
TGATATTTGC 



AGGTCTGAGC CGGTCTCTGC TGCGGTGTTG 
GCCCTTCCAG GAGCTGTTGC TGCTCCGAGA 
AGAACAGCAC CTACTCAAGA AGCTCAGTGG 
AAGTGCTCGG GGTTAGACCT GATCTTTGGT 
CCCAGACTTA CGGTGGAACA GcTCCAACGC 
GCTCTTCCAA GGGTTATAAC ATcTCCTGGG 
AGTTTcTGGA AGAAAGCTCA CATTCTCATC 
AGACTTTGTG GAGTTGCATA AACTTcTACA 
CAAAACTCTA TGGTCCTGAC ATCGGTCAGC 
CTGCTGAGGA GTTTCCTGAA GGCTGGCGGA 
ATGGCATCAC TATTACTTGA ATGGACGCAT 
TGAGCTCTGA TGCGCTGGAC ACTTTTATTC 
AAGGTCACTA AAGAGATCAC ACCTGGCAAG 
GAGCTCAGCT TACGGTGGCG GTGCACCCTT 
CTGGCTTTAT GTGGCTGGAT AAATTGGGCC 
GAAGTCGTGA TGAGGCAGGT GTTCTTCGGA 
GGATGAAAAC TTTGAGCCTT TACCTGATTA 
AGAAACTGGT AGGTCCCAGG GTGTTACTGT 
AGGAGCAAAC TCCGAGTGTA TCTCCACTGC 
ATATCAGGAA GGAGATCTAA CTCTGTATGT 
CCAAGCACTT GAAGGTACCG CCTCCGTTGT 
TACCTTCTGA AGCCTTCGGG GCCGGATGGA 
ACTGAACGGT CAAATTCTGA AGATGGTGGA 
TGACAGAAAA ACCTCTCCCC GCAGGAAGTG 
TCCTATGGTT TTTTTGTCAT AAGAAATGCC 
AAAATAAAAG GCATACGGTA CCCCTGAGAC 
TTCATAAAAC AAAACCCTAG TTTAGGAGGC 
GAGCTTCGGG AGGGTGGGGT ACACTTCAGT 
CTCTCTAAGA AGAATACTGC AGGTGGTGAC 
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AGGAAACTCC AGGTGGAATG 1000 

GCAGTACCAA AAGGAGTTCA 1050 

ACATGCTCTA CAGTTTTGCC 1100 

CTAAATGCGT TACTACGAAC 1150 

CCAGCTTCTC CTTGACTACT 1200 

AACTGGGCAA TGAGCCCAAC 1250 

GATGGGTTGC AGTTAGGAGA 1300 

AAGGTCAGCT TTCCAAAATG 1350 

CTCGAGGGAA GACAGTTAAA 1400 

GAAGTGATCG ACTCTCTTAC 1450 

CGCTACCAAA GAAGATTTTC 1500 

TCTCTGTGCA AAAAATTCTG 1550 

AAGGTCTGGT TGGGAGAGAC 1600 

GCTGTCCAAC ACCTTTGCAG 1650 

TGTCAGCCCA GATGGGCATA 1700 

GCAGGCAACT ACCACTTAGT 1750 

CTGGCTCTCT CTTCTGTTCA 1800 

CAAGAGTGAA AGGCCCAGAC 1850 

ACTAACGTCT ATCACCCACG 1900 

CCTGAACCTC CATAATGTCA 1950 

TCAGGAAACC AGTGGATACG 2000 

TTACTTTCCA AATCTGTCCA 2050 

TGAGCAGACC CTGCCAGCTT 2100 

CACTAAGCCT GCCTGCCTTT 2150 

AAAATCGCTG CTTGTATATG 2200 

AAAAGCCGAG GGGGGTGTTA 2250 

CACCTCCTTG CCGAGTTCCA 2300 

ATTACATTCA GTGTGGTGTT 2350 

AGTTAATAGC ACTGTG 23 96 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
Met Leu Arg Leu Leu Leu Leu Trp Leu Trp Gly Pro Leu Gly Ala 
5 10 15 

Leu Ala Gin Gly Ala Pro Ala Gly Thr Ala Pro Thr Asp Asp Val 
20 25 30 

Val Asp Leu Glu Phe Tyr Thr Lys Arg Pro Leu Arg Ser Val Ser 
35 40 45 

Pro Ser Phe Leu Ser lie Thr He Asp Ala Ser Leu Ala Thr Asp 
50 55 60 

Pro Arg Phe Leu Thr Phe Leu Gly Ser Pro Arg Leu Arg Ala Leu 
65 70 75 

Ala Arg Gly Leu Ser Pro Ala Tyr Leu Arg Phe Gly Gly Thr Lys 
80 85 90 

Thr Asp Phe Leu He Phe Asp Pro Asp Lys Glu Pro Thr Ser Glu 
95 100 105 

Glu Arg Ser Tyr Trp Lys Ser Gin Val Asn His Asp He Cys Arg 
110 115 120 

Ser Glu Pro Val Ser Ala Ala Val Leu Arg Lys Leu Gin Val Glu 
125 130 135 

Trp Pro Phe Gin Glu Leu Leu Leu Leu Arg Glu Gin Tyr Gin Lys 
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Glu Phe Lys Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Met Leu 

155 160 165 

Tyr Ser Phe Ala Lys Cys Ser Gly Leu Asp Leu lie Phe Gly Leu 

170 175 180 

Asn Ala Leu Leu Arg Thr Pro Asp Leu Arg Trp Asn Ser Ser Asn 

185 190 195 

Ala Gin Leu Leu Leu Asp Tyr Cys Ser Ser Lys Gly Tyr Asn lie 

200 205 210 

Ser Trp Glu Leu Gly Asn Glu Pro Asn Ser Phe Trp Lys Lys Ala 

215 220 225 

His lie Leu lie Asp Gly Leu Gin Leu Gly Glu Asp Phe Val Glu 

230 235 240 

Leu His Lys Leu Leu Gin Arg Ser Ala Phe Gin Asn Ala Lys Leu 

245 250 255 

Tyr Gly Pro Asp lie Gly Gin Pro Arg Gly Lys Thr Val Lys Leu 

260 265 270 

Leu Arg Ser Phe Leu Lys Ala Gly Gly Glu Val lie Asp Ser Leu 

275 280 285 

Thr Trp His His Tyr Tyr Leu Asn Gly Arg lie Ala Thr Lys Glu 

290 295 300 

Asp Phe Leu Ser Ser Asp Ala Leu Asp Thr Phe lie Leu Ser Val 

305 310 315 

Gin Lys lie Leu Lys Val Thr Lys Glu lie Thr Pro Gly Lys Lys 

320 325 330 

Val Trp Leu Gly Glu Thr Ser Ser Ala Tyr Gly Gly Gly Ala Pro 

335 340 345 

Leu Leu Ser Asn Thr Phe Ala Ala Gly Phe Met Trp Leu Asp Lys 

350 355 360 

Leu Gly Leu Ser Ala Gin Met Gly lie Glu Val Val Met Arg Gin 

365 370 375 

Val Phe Phe Gly Ala Gly Asn Tyr His Leu Val Asp Glu Asn Phe 

380 385 390 

Glu Pro Leu Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu 

395 400 405 

Val Gly Pro Arg Val Leu Leu Ser Arg Val Lys Gly Pro Asp Arg 

410 415 420 

Ser Lys Leu Arg Val Tyr Leu His Cys Thr Asn Val Tyr His Pro 

425 430 435 

Arg Tyr Gin Glu Gly Asp Leu Thr Leu Tyr Val Leu Asn Leu His 

440 445 450 

Asn Val Thr Lys His Leu Lys Val Pro Pro Pro Leu Phe Arg Lys 

455 460 465 

Pro Val Asp Thr Tyr Leu Leu Lys Pro Ser Gly Pro Asp Gly Leu 

470 475 480 

Leu Ser Lys Ser Val Gin Leu Asn Gly Gin lie Leu Lys Met Val 

485 490 495 

Asp Glu Gin Thr Leu Pro Ala Leu Thr Glu Lys Pro Leu Pro Ala 

500 505 510 

Gly Ser Ala Leu Ser Leu Pro Ala Phe Ser Tyr Gly Phe Phe Val 

515 520 525 

lie Arg Asn Ala Lys lie Ala Ala Cys lie 

530 535 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2396 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

TT TCT AGT 

TGC TTT TAG CCA ATG TCG GAT CAG GTT TTT CAA GCG ACA AAG AGA 
TAC TGA GAT CCT GGG CAG AGG ACA TCC TAG CTC GGT CAG ATT TGG 
GCA GGC TCA AGT GAC CAG TGT CTT AAG GCA GAA GGG AGT CGG GGT 
AGG GTC TGG CTG AAC CCT CAA CCG GGG CTT TTA ACT CAG GGT CTA 
GTC CTG GCG CCA AAT GGA TGG GAC CTA GAA AAG GTG ACA GAG TGC 
GCA GGA CAC CAG GAA GCT GGT CCC ACC CCT GCG CGG CTC CCG GGC 
GCT CCC TCC CCA GGC CTC CGA GGA TCT TGG ATT CTG GCC ACC TCC 
GCA CCC TTT GGA TGG GTG TGG ATG ATT TCA AAA GTG GAC GTG ACC 
GCG GCG GAG GGG AAA GCC AGC ACG GAA ATG AAA GAG AGC GAG GAG 
GGG AGG GCG GGG AGG GGA GGG CGC TAG GGA GGG ACT CCC GGG AGG 
GGT GGG AGG GAT GGA GCG CTG TGG GAG GGT ACT GAG TCC TGG CGC 
CAG AGG CGA AGC AGG ACC GGT TGC AGG GGG CTT GAG CCA GCG CGC 
CGG CTG CCC CAG CTC TCC CGG CAG CGG GCG GTC CAG CCA GGT GGG 
ATG CTG AGG CTG CTG CTG CTG TGG CTC TGG GGG CCG CTC GGT GCC 
Met Leu Arg Leu Leu Leu Leu Trp Leu Trp Gly Pro Leu Gly Ala 
5 10 15 

CTG GCC CAG GGC GCC CCC GCG GGG ACC GCG CCG ACC GAC GAC GTG 
Leu Ala Gin Gly Ala Pro Ala Gly Thr Ala Pro Thr Asp Asp Val 



GTA GAC TTG GAG TTT TAC ACC AAG CGG CCG CTC CGA AGC GTG AGT 
Val Asp Leu Glu Phe Tyr Thr Lys Arg Pro Leu Arg Ser Val Ser 



CCC TCG TTC CTG TCC ATC ACC ATC GAC GCC AGC CTG GCC ACC GAC 
Pro Ser Phe Leu Ser lie Thr lie Asp Ala Ser Leu Ala Thr Asp 



CCG CGC TTC CTC ACC TTC CTG GGC TCT CCA AGG CTC CGT GCT CTG 
Pro Arg Phe Leu Thr Phe Leu Gly Ser Pro Arg Leu Arg Ala Leu 



GCT AGA GGC TTA TCT CCT GCA TAC TTG AGA TTT GGC GGC ACA AAG 
Ala Arg Gly Leu Ser Pro Ala Tyr Leu Arg Phe Gly Gly Thr Lys 



ACT GAC TTC CTT ATT TTT GAT CCG GAC AAG GAA CCG ACT TCC GAA 

Thr Asp Phe Leu lie Phe Asp Pro Asp Lys Glu Pro Thr Ser Glu 



GAA AGA AGT TAC TGG AAA TCT CAA GTC AAC CAT GAT ATT TGC AGG 
Glu Arg Ser Tyr Trp Lys Ser Gin Val Asn His Asp lie Cys Arg 
110 115 120 

TCT GAG CCG GTC TCT GCT GCG GTG TTG AGG AAA CTC CAG GTG GAA 
Ser Glu Pro Val Ser Ala Ala Val Leu Arg Lys Leu Gin Val Glu 



TGG CCC TTC CAG GAG CTG TTG CTG CTC CGA GAG CAG TAC CAA AAG 
Trp Pro Phe Gin Glu Leu Leu Leu Leu Arg Glu Gin Tyr Gin Lys 
140 145 150 



1043 
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GAG TTC AAG AAC AGC ACC TAC TCA AGA AGC TCA GTG GAC ATG CTC 1088 
Glu Phe Lys Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Met Leu 
155 ISO 165 

TAC AGT TTT GCC AAG TGC TCG GGG TTA GAC CTG ATC TTT GGT CTA 1133 
Tyr Ser Phe Ala Lys Cys Ser Gly Leu Asp Leu lie Phe Gly Leu 
170 175 180 

AAT GCG TTA CTA CGA ACC CCA GAC TTA CGG TGG AAC AGc TCC AAC 1178 
Asn Ala Leu Leu Arg Thr Pro Asp Leu Arg Trp Asn Ser Ser Asn 
185 190 195 

GCC CAG CTT CTC CTT GAC TAC TGC TCT TCC AAG GGT TAT AAC ATc 1223 
Ala Gin Leu Leu Leu Asp Tyr Cys Ser Ser Lys Gly Tyr Asn He 
200 205 210 

TCC TGG GAA CTG GGC AAT GAG CCC AAC AGT TTc TGG AAG AAA GCT 1268 
Ser Trp Glu Leu Gly Asn Glu Pro Asn Ser Phe Trp Lys Lys Ala 
215 220 225 

CAC ATT CTC ATC GAT GGG TTG CAG TTA GGA GAA GAC TTT GTG GAG 1313 
Hrs lie Leu He Asp Gly Leu Gin Leu Gly Glu Asp Phe Val Glu 
230 235 240 

TTG CAT AAA CTT cTA CAA AGG TCA GCT TTC CAA AAT GCA AAA CTC 1358 
Leu His Lys Leu Leu Gin Arg Ser Ala Phe Gin Asn Ala Lys Leu 
245 250 255 

TAT GGT CCT GAC ATC GGT CAG CCT CGA GGG AAG ACA GTT AAA CTG 14 03 
Tyr Gly Pro Asp He Gly Gin Pro Arg Gly Lys Thr Val Lys Leu 
260 265 270 

CTG AGG AGT TTC CTG AAG GCT GGC GGA GAA GTG ATC GAC TCT CTT 1448 
Leu Arg Ser Phe Leu Lys Ala Gly Gly Glu Val He Asp Ser Leu 
275 280 285 

ACA TGG CAT CAC TAT TAC TTG AAT GGA CGC ATC GCT ACC AAA GAA 14 93 
Thr Trp His His Tyr Tyr Leu Asn Gly Arg He Ala Thr Lys Glu 
290 295 300 

GAT TTT CTG AGC TCT GAT GCG CTG GAC ACT TTT ATT CTC TCT GTG 1538 
Asp Phe Leu Ser Ser Asp Ala Leu Asp Thr Phe He Leu Ser Val 
305 310 315 

CAA AAA ATT CTG AAG GTC ACT AAA GAG ATC ACA CCT GGC AAG AAG 1583 
Gin Lys He Leu Lys Val Thr Lys Glu He Thr Pro Gly Lys Lys 
320 325 330 

GTC TGG TTG GGA GAG ACG AGC TCA GCT TAC GGT GGC GGT GCA CCC 1628 
Val Trp Leu Gly Glu Thr Ser Ser Ala Tyr Gly Gly Gly Ala Pro 
335 340 345 

TTG CTG TCC AAC ACC TTT GCA GCT GGC TTT ATG TGG CTG GAT AAA 167 3 
Leu Leu Ser Asn Thr Phe Ala Ala Gly Phe Met Trp Leu Asp Lys 
350 355 360 
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TTG GGC CTG TCA GCC CAG ATG GGC ATA GAA GTC GTG ATG AGG CAG 
Leu Gly Leu Ser Ala Gin Met Gly He Glu Val Val Met Arg Gin 



GTG TTC TTC GGA GCA GGC AAC TAC CAC TTA GTG GAT GAA AAC TTT 

Val Phe Phe Gly Ala Gly Asn Tyr His Leu Val Asp Glu Asn Phe 

380 385 390 

GAG CCT TTA CCT GAT TAC TGG CTC TCT CTT CTG TTC AAG AAA CTG 

Glu Pro Leu Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu 



GTA GGT CCC AGG GTG TTA CTG TCA AGA GTG AAA GGC CCA GAC AGG 
Val Gly Pro Arg Val Leu Leu Ser Arg Val Lys Gly Pro Asp Arg 



AGC AAA CTC CGA GTG TAT CTC CAC TGC ACT AAC GTC TAT CAC CCA 
Ser Lys Leu Arg Val Tyr Leu His Cys Thr Asn Val Tyr His Pro 



CGA TAT CAG GAA GGA GAT CTA ACT CTG TAT GTC CTG AAC CTC CAT 

Arg Tyr Gin Glu Gly Asp Leu Thr Leu Tyr Val Leu Asn Leu His 

440 445 450 

AAT GTC ACC AAG CAC TTG AAG GTA CCG CCT CCG TTG TTC AGG AAA 
Asn Val Thr Lys His Leu Lys Val Pro Pro Pro Leu Phe Arg Lys 



CCA GTG GAT ACG TAC CTT CTG AAG CCT TCG GGG CCG GAT GGA TTA 
Pro Val Asp Thr Tyr Leu Leu Lys Pro Ser Gly Pro Asp Gly Leu 



CTT TCC AAA TCT GTC CAA CTG AAC GGT CAA ATT CTG AAG ATG GTG 
Leu Ser Lys Ser Val Gin Leu Asn Gly Gin lie Leu Lys Met Val 



GAT GAG CAG ACC CTG CCA GCT TTG ACA GAA AAA CCT CTC CCC GCA 
Asp Glu Gin Thr Leu Pro Ala Leu Thr Glu Lys Pro Leu Pro Ala 



GGA AGT GCA CTA AGC CTG CCT GCC TTT TCC TAT GGT TTT TTT GTC 

Gly Ser Ala Leu Ser Leu Pro Ala Phe Ser Tyr Gly Phe Phe Val 

515 520 525 

ATA AGA AAT GCC AAA ATC GCT GCT TGT ATA TGA AAA TAA AAG GCA 

He Arg Asn Ala Lys lie Ala Ala Cys He 



TAC GGT ACC CCT GAG ACA AAA GCC GAG GGG GGT GTT ATT CAT AAA 2258 

ACA AAA CCC TAG TTT AGG AGG CCA CCT CCT TGC CGA GTT CCA GAG 2303 

CTT CGG GAG GGT GGG GTA CAC TTC AGT ATT ACA TTC AGT GTG GTG 2348 

TTC TCT CTA AGA AGA ATA CTG CAG GTG GTG ACA GTT AAT AGC ACT 2393 

GTG 2396 

(2) INFORMATION FOR SEQ ID KO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 
CGGCCGCTGC TGCTGCTGTG GCTCTGGGGG CGGCTCCGTG CCCTGACCCA 
AGGCACTCCG GCGGGGACCG CGCCGACCAA AGACGTGGTG GACTTGGAGT 
TTTACACCAA GAGGCTATTC CAAAGCGTGA GTCCCTCGTT CCTGTCCATC 
ACCATCGACG CCAGTCTGGC CACCGACCCT CGGTTCCTCA CCTTCCTGAG 
CTCTCCACGG CTTCGAGCCC TGTCTAGAGG CTTATCTCCT GCGTACTTGA 
GATTTGGCGG CACCAAGACT GACTTCCTTA TTTTTGATCC CAACAACGAA 
CCCACCTCTG AAGAAAGAAG TTACTGGCAA TCTCAAGACA AC AAT GAT AT 
TTGCGGGTCT GACCGGGTCT CCGCTGACGT GTTGA 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 541 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47 
AAATCAGGAC ATATCCTTCA CTTATTTGCC TCTTGGTCAT ATTGGAGGCA 
TTTGTATTCA TTTTTAATAA CCCTCAAAAT AGTGCATGCA AAGTGCTAAG 
CGTCATTTGC CACATGGTGC CATTAACTGT CACCACCTGC AGTGGT CTAC 
TTAGAGAACA CCGCACTGGA TGTTAACACT GAAGCGCGTG CCCCGCCCTC 
CCGAGGCTCT GGATCCAGCG TTGAAGCTTG CCCCGCCCTC CCGAGGCTCT 
GGATCCAGCA CTGGAGCATG CCCCGCCCTC CCGAGGCTCT GGAGCTTGCT 
AAGGAGTCCG CTCCCTACCG CTGGGGTTTT GCTTTATTCT TATGAATGAC 
ACCCCTGACC GCTTTCGTCT CAGGGGTACT GTAATGCCTT TTATTTTCAT 
ATACAAGCTG CGATTTTGGC ATTTCTTATG ACAAAAAACC CATAGGAAAA 
GGCGGGCACG CTTAGTGAGC TTCCTGCGGG GAGAGGTTTT TCTGTTAGAG 
CTGGCANGGT CTGCTCATCG ACCATCTTCA GGCCTCGTGC C 



